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CIXL2: A Crossover Operator for Evolutionary Algorithms 
Based on Population Features 



In this paper we propose a crossover operator for evolutionary algorithms with real 
values that is based on the statistical theory of population distributions. The operator is 
based on the theoretical distribution of the values of the genes of the best individuals in 
the population. The proposed operator takes into account the localization and dispersion 
features of the best individuals of the population with the objective that these features 
would be inherited by the offspring. Our aim is the optimization of the balance between 
exploration and exploitation in the search process. 

In order to test the efficiency and robustness of this crossover, we have used a set of 
fimctions to be optimized with regard to different criteria, such as, multimodality, sep- 
arability, regularity and epistasis. With this set of functions we can extract conclusions 
in function of the problem at hand. We analyze the results using ANOVA and multiple 
comparison statistical tests. 

As an example of how our crossover can be used to solve artificial intelligence problems, 
we have applied the proposed model to the problem of obtaining the weight of each network 
in a ensemble of neural networks. The results obtained are above the performance of 
standard methods. 

1. Introduction 

Evolutionary algorithms (EAs) are general purpose searching methods. The selection pro- 
cess and the crossover and mutation operators establish a balance between the exploration 
and exploitation of the search space which is very adequate for a wide variety of problems 
whose solution presents difficulties that are insolvable using classical methods. Most of 
these problems arc defined in continuous domains, so the evolutionary algorithms applied 
use real values, namely, evolution strategies (EPs), real-coded genetic algorithms (RCGAs), 
and evolutionary programming (EP). For these paradigms the precision of the solution does 
not depend on the coding system, as in binary coded genetic algorithms, but on the precision 
of the computer system where the algorithms are run. 

The selection process drives the searching towards the regions of the best individuals. 
The mutation operator randomly modifies, with a given probability, one or more genes of a 
chromosome, thus increasing the structural diversity of the population. As we can see, it is 
clearly an exploration operator, that helps to recover the genetic diversity lost during the 
selection phase and to explore new solutions avoiding premature convergence. In this way, 
the probability of reaching a given point in the search space is never zero. This operator. 
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Figure 1: (a) Hypercube defined by the first two genes of the parents; (b) Representation 
of the segment defined by the ith genes of two chromosomes. 



in fact, implements a random search whose well-studied features are useful in the field of 

evolutionary computation. 

The crossover operator combines the genes of two or more parents to generate better 
offspring. It is based on the idea that the exchange of information between good chro- 
mosomes will generate even better offspring. The effect of the crossover operator can be 
studied from two different points of view: at chromosome level and at gene level. The effect 
of the crossover operator at chromosome level can be considered in a geometric way. Given 
two parents = {/Si,/?!} ^-^d = {/^i,/?!} with two genes, we denote by -f^/ji/jz the 
hypercube defined by their genes (Figure la). At gene level the representation would be 
linear, defining in this case a segment or interval S,oi ,02 for each pair of genes (Figure lb). 
Most crossover operators generate individuals in the exploitation zones, >S'^i^^2 or Hpif^2. 
In this way, the crossover operator implements a depth search or exploitation, leaving the 
breadth search or exploration for the mutation operator. 

This policy, intuitively very natural, makes the population converge to values within 
the hypercubes defined by their parents, producing a rapid decrease in the population 
diversity which could end up in a premature convergence to a non-optimal solution. Recent 
studies on BLX-a crossover (Eshelman & Schaffer, 1993), the crossover based on fuzzy 
connectives (Hcrrcra, Hcrrcra-Vicdma, Lozano, & Vcrdcgay, 1994), and fuzzy recombination 
(Voigt, Miihlenbein, &; Cvetkovic, 1995), have confirmed the good performance of those 
crossover operators that also generate individuals in the exploration zone. These operators 
avoid the loss of diversity and the premature convergence to inner points of the search 
space, but also the generation of new individuals in the exploration zone could slow the 
search process. For this reason, the crossover operator should establish an adequate balance 
between exploration (or interpolation) and exploitation (or extrapolation), and generate 
offspring in the exploration and exploitation zones in the correct proportion. 

Establishing a balance between exploration and exploitation is important, but it is also 
important that such a balance is self-adaptive (Kita, 2001; Beyer &: Deb, 2001; Deb & 
Beyer, 2001), that is, it must guarantee that the dispersion of the offspring depends on 
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the dispersion of the parents. So, two close parents must generate close offspring, and two 
distant parents must generate distant offspring. The control of dispersion in the crossover 
based on fuzzy connectives is based on the generation of offspring using the fuzzy connec- 
tives i-norms, t-conorms, average functions, and a generalized operator of compensation 
(Mizumoto, 1989). In fuzzy recombination the offspring is generated using two triangular 
distributions whose averages derive from each of the genes of the two parents. In BLX-a 
we have the same probability of generating an offspring between the parents, and in an area 
close to the parents whose amplitude is modulated by the a parameter. 

Ono and Kobayashi (1997) have proposed a Unimodal Normally Distributed Crossover 
(UNDX), where three parents arc used to generate two or more children. The children are 
obtained using an ellipsoidal distribution where one axis is the segment that joins the two 
parents and the extent of the orthogonal direction is decided by the perpendicular distance 
of the third parent from the axis. The authors claim that this operator should preserve the 
statistics of the population. This crossover is also self-adaptive, but it differs from BLX-a 
by the fact that it is more probable to generate offspring near the average of the first two 
parents. 

Another self-adaptive crossover is the Simulated Binary Crossover (SBX) (Deb & Agrawal, 
1995). Based on the search features of the single-point crossover used in binary-coded ge- 
netic algorithms, this operator respects the interval schemata processing, in the sense that 
common interval schemata of the parents are preserved in the offspring. The SBX crossover 
puts the stress on generating offspring near the parents. So, the crossover guarantees that 
the extent of the children is proportional to the extent of the parents, and also favors 
that near parent individuals are monotonically more likely to be chosen as children than 
individuals distant from the parents. 

The main goal of this paper is to propose a crossover operator that avoids the loss 
of diversity of the population of individuals, and, at the same time, favors the speed of 
convergence of the algorithm. These two goals are, at first, conflicting; their adequate 
balance is controlled by two of the basic features of the crossover operator: i) the balance 
between exploration and exploitation and, ii) the self-adaptive component. These two 
features make the evolutionary algorithms avoid premature convergence and favor local 
fine-tuning. Both attributes are highly appreciated in any search algorithm. 

In most current crossover operators, the features of the offspring depend on the features 

of just a few parents. These crossovers do not take into account population features such 
as localization and dispersion of the individuals. The use of these statistical features of the 
population may help the convergence of the population towards the global optimum. 

The crossover operator implements basically a depth or exploitative search, just like 
other methods such as steepest gradient descent, local search or simulated annealing, but 
in these three search methods the algorithm takes the quality of the solutions into account. 
So, it is reasonable to think that it is also convenient for the crossover operator to consider 
the performance on the individuals involved in the crossover operation. This idea is already 
implemented by some heuristic crossovers (Wright, 1991). 

Nevertheless, following the previous line of argument, it seems rather poor to use just 
two parents, and not to consider the most promising directions towards which it would 
be advisable to drive the search. That is, instead of using a local heuristic that uses two 
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individuals, involving the whole population or an adequate subset in the determination of 
the direction of the search whose features would be specially suitable. 

Motivated by this line of argument, in this paper we propose a crossover operator, which 
will be called Confidence Interval Based Crossover using L2 Norm (CIXL2). On the one 
hand, it takes advantage of the selective component that is derived from the extraction of 
the features of the best n individuals of the population and that indicates the direction of 
the search, and on the other hand, it makes a self-adaptive sampling around those features 
whose width depends on the number of best individuals, dispersion of those best individuals, 
confidence coefficient, and localization of the individuals that participate in the crossover. 
Now, the exploitation region is not the area between the two parents that arc involved 
in the crossover, but the area defined by the confidence interval built from the n best 
individuals of the population; and the exploratory region is the rest of the search domain. 
To the previous concepts of exploration and exploitation, merely geometrical, is added a 
probabilistic component that depends on the population features of the best individuals. 

Estimation of Distribution Algorithms (EDAs) or Probabilistic Model-Building Evolu- 
tionary Algorithms (Miihlenbein & Paa/3, 1998; Miihlenbein, Mahnig, & Rodriguez, 1999) 
are based on a, seemingly, similar idea. These algorithms do not have mutation and crossover 
operators. After every generation the population distribution of the selected individuals is 
estimated and the new individuals are obtained sampling this estimated distribution. How- 
ever, the underlying idea behind our crossover is the extraction of population features, mean 
and standard deviation, in order to detect the regions where there is a higher probability 
of getting the best individuals. In order to perform the crossover, we create three virtual 
parents that represent the localization estimator mean, and the bounds of the confidence 
interval from which, with a certain confidence degree, this localization estimator takes the 
values. In this way, the children generated from these three parents will inherit the features 
of the best individuals of the population. 

The rest of the paper is organized as follows: Section 2 explains the definition of CIXL2 
and its features; Section 3 discusses the problem of the selection of the test sets, and 
justifies the use of a test set based on the one proposed by Eiben and Back (1997a); Section 
4 describes the experimental setup of evolutionary algorithm (RCGA) used in the tests; 
Section 5 studies the optimal values of the parameters of CIXL2; Section 6 compares the 
performance of CIXL2 against other crossovers; Section 7 compares CIXL2 with EDAs; 
Section 8 describes the application of RCGAs with CIXL2 to neural network ensembles; 
and, finally. Section 9 states the conclusions of our paper and the future research lines. 

2. CIXL2 Operator 

In this section we will explain the theoretical base that supports the defined crossover 
operator, and then we will define the crossover. We will use an example to explain the 
dynamics of a population subject to this crossover operator. 

2.1 Theoretical Foundation 

In this section we will study the distribution of the i-th. gene and the construction of a 
confidence interval for to the localization parameter associated with that distribution. 
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Let /3 be the set of A'^ individuals with p genes that make up the population and /3* C 
(3 the set of the best n individuals. If we assume that the genes (3* of the individuals 
belonging to (5* are independent random variables with a continuous distribution H{j3*) 
with a localization parameter fip* , we can define the model 

Pi = IJ-P* + ei, for i = 1, ...,p, (1) 

being Cj a random variable. If we suppose that, for each gene i, the best n individuals form 
a random sample {P*i,P*2, Ptn} distribution of /?*, then the model takes the form 

Pij = /^/3* + eij, for i = 1, ...,p and j = 1, ...,n. (2) 

Using this model, wc analyze an estimator of the localization parameter for the i-th 
gene based on the minimization of the dispersion function induced by the L2 norm. The L2 
norm is defined as 



pi II 2 



Y.(^^J)^ (3) 

i=i 



hence the associated dispersion induced by the L2 norm in the model 2 is 

n 

^2(M/3?) = E(A^-m*)'' (4) 
and the estimator of the localization parameter np* is: 

n 

A/3* = arg min D2 ) = arg min ^ - np* f. (5) 
Using for minimization the steepest gradient descent method, 

dD2 (UR* ) 



we obtain 



n 



52(m/3*) = 2^(/3*,-/.^*), (7) 



and making (7) equal to yields 



MA* = Pi ■ (8) 



n 



So, the estimator of the localization parameter for the z-th gene based on the minimiza- 
tion of the dispersion function induced by the L2 norm is the mean of the distribution of 
(3* (KendaU & Stuart, 1977), that is, /i^* = 
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The sample mean estimator is a linear estimator^, so it has the properties of unbi- 
asedness^ and consistency^, and it follows a normal distribution N{nfs*,a'^,/n) when the 
distribution of the genes H{(3*) is normal. Under this hypothesis, we construct a bilateral 
confidence interval for the localization of the genes of the best n individuals, using the 
studentization method, the mean as the localization parameter,and the standard deviation 
Sj3* as the dispersion parameter: 



tCI 



Sa* - Sn* 

+ _Jli. R* \ + 

'^n-l,a/2~7=)Pi "r '^n-l,a/2~7= 



(9) 



where tn_i,Q/2 is the value of Student's t distribution with n — 1 degrees of freedom, and 
1 — a is the confidence coefficient, that is, the probability that the interval contains the true 
value of the population mean. 



2.2 CIXL2 Definition 

Prom this definition of the confidence interval, we define three intervals to create three "vir- 
tual" parents, formed by the lower limits of the confidence interval of each gene, CILL^, the 
upper limits, CIUL^, and the means CIM^. These parents have the statistical information 
of the localization features and dispersion of the best individuals of the population, that is, 
the genetic information the fittest individuals share. Their definition is: 



where 



CILL = {CILLi,...,CILLi,...CILLp) 
CIUL = {CIULi,...,CIULi,...CIULp) 
CIM = {CIMi,...,CIMi,...CIMp), 



CILLi = /?j — in-l,a/2^= 



(10) 



n 



(11) 



CIULi 
CI Mi = A. 



Pi + tn-l,a/2— 7= 



The CILL and CIU L individuals divide the domain of each gene into three subintervals: 
Di = I^yj If I U /f , where = [ai,CILLi); if^ = {CILLi, CIULi]; if = {CIULi,bi]; 
being and bi the bounds of the domain (see Figure 2). 

The crossover operator creates one offspring from an individual of the population 
G /3, randomly selected, and one of the individuals CILL, CIUL or CIM, depending 
on the localization of (3^ , as follows: 



1. It is a linear combination of the sample values. 

2. An estimator is an unbiased estimator of 9 if the expected value of the estimator is the parameter to 

be estimate: E[9] = 6. 

3. A consistent estimator is an estimator that converges in probability to the quantity being estimated as 
the sample size grows. 

4. Confidence Interval Lower Limit. 

5. Confidence Interval Upper Limit. 

6. Confidence Interval Mean. 
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D. 



CILL. ^^"^i ClUL. b. 



Figure 2: An example of confidence interval based crossover 

• f3{ ii the fitness of is higher than CILL, then Pf = r(/3/ - CILLi) + /?/, else 
Pt =riCILLi-(3{) + CILLi. 

• P{ G /P: if the fitness of P^ is higher than CIM, then P? = r{p{ - CIMi) + /?/, else 
Pt = r{CIMi-pl) + CIMi. 

• P{ e If: if the fitness of P^ is higher than CIUL, then P? = r{p{ - CIULi) + p{ , else 
Pt = r{CIULi -P() + CIULi (this case can be seen in Figure 2). 

where r is a random number in the interval [0, 1]. 

With this definition, the offspring always takes values in the direction of the best of the 
two parents but never between them. If the virtual individual is one of the bounds of the 
confidence interval and is better than the other parent, the offspring is generated in the 
direction of the confidence interval where it is more likely to generate better individuals. 
If the virtual individual is worse than the other parent, the offspring is generated near the 
other parent in the opposite direction of the confidence interval. On the other hand, if a 
parent selected from the population is within the confidence interval, the offspring can be 
outside the interval - but always in its neighborhood - if the fitness of the center of the 
confidence interval is worse. This formulation tries to avoid a shifting of the population 
towards the confidence interval, unless this shifting means a real improvement of the fitness 
in the population. 

If P^ is distant from the other parent, the offspring will probably undergo a marked 
change, and if both parents are close, the change will be small. The first circumstance will 
be likely to occur in the first stages of the evolutionary process, and the second one in the 
final stages. 

The width of the interval l'^^ depends on the confidence coefficient, 1 — a, the number 
of best individuals, n, and the dispersion of the best individuals. In the first stages of the 
evolution, the dispersion will be large, specially for multimodal functions, and will decrease 
together with the convergence of the genetic algorithm. These features allow the balance 
between exploitation and exploration to adjust itself dynamically. The crossover will be 
more exploratory at the beginning of the evolution, avoiding a premature convergence, and 
more exploitative at the end, allowing a fine tuning. The parameters n and 1—a regulate the 
dynamics of the balance favoring a higher or lower degree of exploitation. That suggests the 
CIXL2 establishes a self-adaptive equilibrium between exploration and exploitation based 
on the features that share, with a certain confidence degree 1 — a, the best n individuals 
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Figure 3: Effect of the CIXL2 crossover over a population used for the minimization of the 
Rosenbrock function with two variables 



of the population. A preliminary theoretical study of this aspect is carried out by Hervas- 
Martmez and Ortiz-Boyer (2005). 

2.3 Crossover Dynamics 

Figure 3 shows a simulation of the behavior of the crossover for the optimization of Rosen- 
brock function (Eiben & Back, 1997b) with two variables. On Figure 3a, we observe how 
most of the individuals are within the domain D^^^; while the best n are within the confi- 
dence domain D^^ = I^^ x I^^^. D^^ is shifted towards the minimum of the function placed 
in (1, 1), the domain D'-^^^ of the new population, generated after applying CIXL2, will be 
shifted to the optimum. This displacement will be higher in the first stages of evolution, 
and will decrease during evolution. It may be modulated by the parameters n and 1 — a. 

Figure 3a shows how the population, after applying the crossover operator, is distributed 
in a region nearer the optimum whose diversity depends on the parameters of the operator. 

Figure 3b shows how the whole population and the n best individuals are distributed. As 
we can see, the distribution of the best n individuals keeps the features of the distribution of 
the population, but it is shifted to the optimum. The shifting towards the optimum will be 
more marked if the value of n is small. The tails of the distribution of the best individuals 
will be larger if the dispersion of the best individuals is also large, and smaller if they are 
concentrated in a narrow region. The size of these tails also depends on the features of the 
problem, the stage of the evolution, and the particular gene considered. The effect of the 
crossover on the distribution of the population is to shift the distribution towards the best 
n individuals and to stretch the distribution modulately depending on the amplitude of the 
confidence interval. The parameters n and 1 — a are responsible for the displacement and 
the stretching of the region where the new individuals will be generated. 

If n is small, the population will move to the most promising individuals quickly. This 
may be convenient for increasing the convergence speed in unimodal functions. Nevertheless, 
it can produce a premature convergence to suboptimal values in multimodal functions. If 
n is large, both the shifting and the speed of convergence will be smaller. However, the 
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evolutionary process will be more robust, this feature being perfectly adequate for the 
optimization of multimodal, non-separable, highly epistatic functions. 

The parameter n is responsible for the selectiveness of the crossover, as it determines 
the region where the search will be directed. The selection is regulated by the parameter 
1 — a. This parameter bounds the error margin of the crossover operator in order to obtain 
a search direction from the feature that shares the best individuals of the population. 

3. Benchmark Problems 

In the field of evolutionary computation, it is common to compare different algorithms using 
a large test set, especially when the test involves function optimization (Gordon & Whitley, 
1993). However, the effectiveness of an algorithm against another algorithm cannot be 
measured by the number of problems that it solves better. The "no free lunch" theorem 
(Wolpcrt & Macrcady, 1995) shows that, if we compare two searching algorithms with all 
possible functions, the performance of any two algorithms will be , on average, the same 
. As a result, attempting to design a perfect test set where all the functions are present 
in order to determine whether an algorithm is better than another for every function, is a 
fruitless task. 

That is the reason why, when an algorithm is evaluated, we must look for the kind 
of problems where its performance is good, in order to characterize the type of problems 
for which the algorithm is suitable. In this way, we have made a previous study of the 
functions to be optimized for constructing a test set with fewer functions and a better 
selection (Whitley, Mathias, Rana, & Dzubcra, 1995; Salomon, 1996). This allows us to 
obtain conclusions of the performance of the algorithm depending on the type of function. 

Taking into account this reasoning, the test set designed by Eiben and Back (1997b) is 
very adequate. The test set has several well characterized functions that will allow us to 
obtain and generalize, as far as possible, the results regarding the kind of function involved. 
Nevertheless, we have added two functions to the test set with the aim of balancing the 
number of functions of each kind. These two new functions are the function of Rosenbrock 
(Rosenbrock, 1960) extended to p dimensions and the function of Schwefel (Schwefel, 1981); 
both of them have been widely used in evolutive optimization literature. Table 1 shows the 
expression of each function and a summary of its features: separability, multimodality, and 
regularity. 

A function is multimodal if it has two or more local optima. A function of p variables is 
separable if it can be rewritten as a sum of p functions of just one variable (Hadley, 1964). 
The separability is closely related to the concept of epistasis or interrelation among the 
variables of the function. In the field of evolutionary computation, the epistasis measures 
how much the contribution of a gene to the fitness of the individual depends on the values 
of other genes. 

Non separable functions are more difficult to optimize as the accurate search direction 
depends on two or more genes. On the other hand, separable functions can be optimized for 
each variable in turn. The problem is even more difficult if the function is also multimodal. 
The search process must be able to avoid the regions around local minima in order to 
approximate, as far as possible, the global optimum. The most complex case appears when 
the local optima are randomly distributed in the search space. 



9 



ORTIZ-BOYER, HERVAS-MARTI'nEZ, & GARCfA-PEDRAJAS 



Function Definition Multimodal? Separable? Hegular? 



Sphere 


e [-5.12, 5.12] 
X* ^ (0, 0, . . . , 0); fq„i,(x*) ^ 


no 


yes 


n/a 


Schwefel's 


/SchDs(x) = E?=i (E}=i 


no 


no 


n/a 


double sum 


aii e [-65.536, 65.536] 

X* = (0,0, . .. ,0); fschosi^") = 








Rosenbrock 


/ros W = Epi'[100{xi+i - x^if + (Xi - if] 

Xi e [-2.048, 2.048] 

X* =(1,1, /bos(^*) = 


no 


no 


n/a 


Rastrigin 


/flasW = lOp + ET-iCi!? - 10coB(27rXi)) 

^ [—5.12, 5.12] 
X* = (0, 0, .'. . , 0); /Has{x') = 


yes 


yes 


n/a 


Schwefel 


/Sch W = 418.9829 ■ P + ELl ^i"*" (\/F7T) 

S [-612.03, 611.97] 
X* = (-420.9687, . . . , -420.9687); fschi^*) = 


yes 


yes 


n/a 




f A 1 (-x^ — on 4- f> — OC^P'rr, I —C\ / ^ S^P '7-2 \ _ 

JAckK^} — .^u i- e zuexp i ^-^y p 2^ i=i J 
exp Ei'=i cos(2ira;i)^ 
6 [-30, 30] 
X* =(0,0, ...,0); /Acfc(x*) = 


yes 




yes 


Griewangk 


/g.,(x) = 1 + ELl 40fe - COS (^) 

Xi 6 [-600, 600] 

x*(0,0, ...,0); /gw(x*) =0 


yes 


no 


yes 


Fletcher 


/FieW=EL,{A, -i3i)2 


yes 


no 


no 


Powell 


= ^._^{aij sin(Xj -\- b^j costy.j) 
Bi = ^ ^_ -j^ (a jj sinxj -j- hijcosxj) 
Xi,ai £ [-7r,7r]; a^jj^ij £ [-100.100] 
X* ^ a; /^;^(x*) = 








Langerman 


cos (7rEf=i(^j - ^ij)"^) 
Xi e [0, 10]; m = p 
X* = random; fLani^*) — random 


yes 


no 


no 



Table 1: Definition of each function together with its features 



The dimensionality of the search space is another important factor in the complexity 
of the problem. A study of the dimensionality problem and its features was carried out by 
Friedman (1994). In order to establish the same degree of difficulty in all the problems, we 
have chosen a search space of dimensionality p = 30 for all the functions. 

Sphere function has been used in the development of the theory of evolutionary strategics 
(Rechenberg, 1973), and in the evaluation of genetic algorithms as part of the test set 
proposed by De Jong (1975). Sphere, or De Jong's function Fl, is a simple and strongly 
convex function. Schwefel's double sum function was proposed by Schwefel (1995). Its main 
difficulty is that its gradient is not oriented along their axis due to the cpistasis among their 
variables; in this way, the algorithms that use the gradient converge very slowly. Rosenbrock 
function (Rosenbrock, 1960), or De Jong's function F2, is a two dimensional function with 
a deep valley with the shape of a parabola of the form = X2 that leads to the global 
minimum. Due to the non-linearity of the valley, many algorithms converge slowly because 
they change the direction of the search repeatedly. The extended version of this function 
was proposed by Spedicato (1975). Other versions have been proposed (Oren, 1974; Dixon, 
1974). It is considered by many authors as a challenge for any optimization algorithm 
(Schlierkamp-Voosen, 1994). Its difficulty is mainly due to the non- linear interaction among 
its variables. 

Rastrigin function (Rastrigin, 1974) was constructed from Sphere adding a modulator 
term a ■ cos{2iTXi). Its contour is made up of a large number of local minima whose value 
increases with the distance to the global minimum. The surface of Schwefel function (Schwe- 
fel, 1981) is composed of a great number of peaks and valleys. The function has a second 
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best minimum far from the global minimum where many search algorithms are trapped. 
Moreover, the global minimum is near the bounds of the domain. 

Ackley, originally proposed by Ackley (1987) and generalized by Back (1993), has an 
exponential term that covers its surface with numerous local minima. The complexity of 
this function is moderated. An algorithm that only uses the gradient steepest descent 
will be trapped in a local optima, but any search strategy that analyzes a wider region 
will be able to cross the valley among the optima and achieve better results. In order to 
obtain good results for this function, the search strategy must combine the exploratory and 
exploitative components efficiently. Griewangk function (Back, Fogel, k, Michalewicz, 1997) 
has a product term that introduces interdependence among the variables. The aim is the 
failure of the techniques that optimize each variable independently. As in Ackley function, 
the optima of Griewangk function are regularly distributed. 

The functions of Fletcher-Powell (Fletcher &: Powell, 1963) and Langerman (Bersini, 
Dorigo, Langerman, Seront, & Gambardella, 1996) are highly multimodal, as Ackley and 
Griewangk, but they are non-symmetrical and their local optima are randomly distributed. 
In this way, the objective function has no implicit symmetry advantages that might simplify 
optimization for certain algorithms. Fletcher-Powel function achieves the random distribu- 
tion of the optima choosing the values of the matrixes a and b, and of the vector a at 
random. We have used the values provided by Back (1996). For Langerman function, we 
have used the values of a and c referenced by Eiben and Back (1997b). 

4. Evolutionciry Algorithm 

The most suitable evolutionary algorithms to solve optimization problems in continuous 
domains are evolutionary strategics (Schwefel, 1981; Rechenberg, 1973), genetic algorithms 
(Holland, 1975; Goldberg, 1989a) with real coding (Goldberg, 1991) and evolutionary pro- 
gramming (Fogel, Owens, & Walsh, 1966; Fogel, 1995). For evaluating CIXL2 we have 
chosen real coded genetic algorithms, because they are search algorithms of general pur- 
pose where the crossover operator plays a central role. The general structure of the genetic 
algorithm is shown in Figure 4. 

Nevertheless, CIXL2 could be applied to any evolutionary algorithms with a crossover 
or similar operator. On the other hand, the real codification is the most natural one in 
continuous domains, each gene representing a variable of the function. In this way, the 
precision of the solution only depends on the data type used to store the variables. 

Our objective is the comparison of the behavior of the proposed crossover against other 
crossovers. This comparison must be made in a common evolutionary framework that is 
defined by the features of the genetic algorithm. For the definition of such features, we have 
taken into account the previous studies on the matter. In the following paragraphs we will 
describe in depth the different components of our genetic algorithm. 

4.1 Structure of the Individual and Population Size 

Each individual is made up of p = 30 genes, the dimensionality of the functions to optimize. 

The size of the population is one of the critical parameters for many applications. If 
the size of the population is too small, the algorithm could converge quickly towards sub- 
optimal solutions; if it is too large, too much time and resources could be wasted. It is also 
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Genetic algorithm 
begin 

initialize (3{t) 
evaluate (3{t) 

while (not stop-criterion) do 
begin 

select l3{t) from /3(t - 1) 
crossover /3(t) 
mutate (3{t) 
evaluate P{t) 
end 

end 



Figure 4: Structure of the genetic algorithm, t is the current generation. 



obvious that the size of the population, together with the selective pressure, influences the 

diversity of the population. 

Several researches have studied these problems from different points of view. Grefen- 
stette (1986) used a meta-genetic algorithm for controlling the parameters of another genetic 
algorithm, such as population size and the selection method. Goldberg (1989b) made a the- 
oretical analysis of the optimum population size. A study of the influence of the parameters 
on the search process was carried out by Schaffer, Caruana, Eshelman and Das (1989). 
Smith (1993) proposed an algorithm that adjusts the size of the population with respect to 
the error probability of the selection . Another method consists of changing the size of the 
population (Arabas, Michalewicz, & Mulawka, 1994) dynamically. 

The size of the population is usually chosen in an interval between 50 and 500 individuals, 
depending on the difficulty of the problem. As a general practice, in function optimization, 
the size is in the interval [50, 100] for unimodal functions, and in the interval [100, 500] 
for multimodal functions. However, several papers use a compromise size of 100 for all the 
functions in order to homogenize the comparison environment. We will also use a population 
size of 100 individuals like other comparative studies (Zhang &; Kim, 2000; Takahashi, Kita, 
& Kobayashi, 1999). 

4.2 Selection 

Zhang and Kim (2000) a comparative study was carried out of the performance of four 
selection methods: proportional, ranking, tournament and Genitor. In contrast to other 
studies that are based on an asymptotic study under more or less ideal conditions, this 
paper is devoted to a practical case, the problem of machine layout. The paper analyzes 
the quality of the sohitions obtained in a reasonable amount of time and using mutation 
and crossover operators. The study concludes that the methods of ranking and tournament 
selection obtain better results than the methods of proportional and Genitor selection. 
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We have chosen the binary tournament selection, against the ranking selection, used by 
Zhang and Kim (2000) for two reasons: 

• The complexity of the tournament selection is lower than the complexity of the ranking 
selection (Back, 1996). 

• The selective pressure is higher. This feature allows us to measure whether each 
crossover is able to keep the population diversity (Goldberg & Deb, 1991). 

Tournament selection runs a tournament between two individuals and selects the winner. 
In order to assure that the best individuals always survive to the next generation, we use 
elitism, the best individual of the population in generation t is always included in the 

population in generation t + 1. It has been proved, both theoretically (Rudolph, 1994) and 
empirically (Back, 1996; Michalewicz, 1992; Zhang & Kim, 2000), the convenience of the 
use of elitism. 

4.3 Population Update Model 

There are different techniques for updating the population, among the most important are 
the generational model and the steady-state model. In the generational model in each 
generation a complete set of N new offspring individuals is created from A'^ parents selected 
from the population. In most such generational models, the tournament selection is used to 
choose two parent individuals, and a crossover with pc probability and a mutation operator 
con pm probability are applied to the parents. 

This contrasts with the steady-state model, where one member of the population is 
replaced at a time. The steady-state model selects an individual to be mutated and the 
mutated individual replaces another individual of the population. For the crossover two 
individuals are selected and one of the offspring replaces one individual of the population. 
There are a number of different replacement strategies: replace- worst, replace a randomly 
chosen member, select replacement using negative fitness. 

The model that extrapolates between generational and steady-state is said to have a 
generation gap G (De Jong, 1975; Jong & Sarma, 1993). Thus for a generational model, 
G = 1; while for a steady-state model, G = 1/N. One of the most widely used variants of 
the steady-stated genetic algorithm is the Minimal Generation Gap (MGG) model (Satoh, 
Yamamura, & Kobayashi, 1996). This model takes two parents randomly from the pop- 
ulation and generates A children. Two individuals are selected from the parents and the 
offspring: the best individual, and another individual chosen by roulette selection. These 
two individuals substitute the parents in the population. 

The generational model is the most frequently used in the comparative studies that use 
BLX, SBX, logical crossover and fuzzy recombination. This is the reason why it will be the 
model used in this paper. However, for UNDX crossover we have used the MGG model, 
because UNDX and MGG are commonly used together and the generational model can 
have a negative influence on the performance of UNDX. 

For the parameters of the two models we have used the most commonly used in the 
literature. For the generational model, we use a probability of crossover of pc = 0.6 (De 
Jong, 1975; Herrera, Lozano, & Verdegay, 1998). For the MGG model we have used A = 200, 
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as this is a value commonly used in papers about UNDX (Ono & Kobayashi, 1997; Ono, 
Kita, & Kobayashi, 1999; Ono, Kobayashi, & Yoshida, 2000). For the mutation probability, 
values in the interval pm G [0.001,0.1] are usual (De Jong, 1975; Herrera et al., 1998; 
Michalewicz, 1992; Back, 1996). We have chosen a value of Pm = 0.05 for both models. 

4.4 Initialization 

In a search algorithm, the initialization method is very important. In many cases the 
initialization determines the success or failure of the search process. We have opted, as in 
other papers (Herrera et al., 1998; De Jong, 1975; Beyer &; Deb, 2001; Herrera, Lozano, 
& Sanchez, 2003), for initializing the values of the genes by means of a uniform random 
distribution within the domain of each variable. 

4.5 Mutation 

As mutation operator we have chosen the non-uniform mutation with parameter b = 5 
(Michalewicz, 1992) as its dynamical nature makes it very suitable for a wide variety of 
problems (Herrera & Lozano, 2000). 

The individuals generated by this mutation are obtained as follows: 



where t is the generation, gmax is the maximum number of generations, r is a random value, 
T G {0, 1}, r is a random number in the interval [0, 1] and 6 is a parameter that determines 

the degree of dependence of the mutation with regards to the number of iterations. Equation 
13 gives values in the interval [0,y]. The probability of obtaining a value near increases 
as the algorithm progresses. This operator performs a uniform search in the initial stages 
of the evolution, and a very localized search in the final stages. 

4.6 Stop Criterion 

The part of the genetic algorithm that takes up most of the time is the evaluation of the 
fitness function. The number of evaluations of the fitness in each generation depends on the 
operators used and the population update model. Different operators and update models 
can lead to very different numbers of evaluations per generation. That is the reason why 
it is common to use the number of evaluations as the stop criterion instead of the number 
of generations. We have used a limit of 300,000 evaluations (Eiben, van der Hauw, & van 
Hemert, 1998; De Jong &; Kosters, 1998) as stop criterion. The precision of the solutions 
is bounded by the precision of the data type used in the implementation of the genetic 
algorithm. We have used a double precision data type of 64 bits following the specification 
ANSI/IEEE STD 754-1985 (IEEE Standard for Binary Floating-Point Arithmetic). This 
data type has a precision of 15 - 17 digits. 




(3i + A{t,bi-(3i) si T = 
/3i - A{t,Pi - Qi) si T = l 



(12) 



being 




(13) 
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5. Analysis of CIXL2 

In this section we will perform an analysis of the crossover, and will obtain for every test 
function the following information: 

1. The optimal value for the confidence coefficient 1 — a of the confidence interval. The 
values used are 1 - a = {0.70, 0.90, 0.95, 0.99}. 

2. The optimal number of best individuals used by the crossover to calculate the confi- 
dence intervals of the mean. The values used are n = {5, 10, 30, 60, 90}. 

These two factors are not independent, so we will perform an analysis using all the 
possible pairs (1 — a, n) oi the Cartesian product of the two sets. For each pair we will 
perform 30 runs of the genetic algorithm with different random seeds. Table 2 shows the 
average value and standard deviation of the 30 runs for each experiment. 

The study of the results has been made by means of an analysis of variance ANOVA 
II (Dunn & Clark, 1974; Miller, 1981; Snedecor & Cochran, 1980), with the fitness of the 
best individuals. A, as test variable. This fitness is obtained independently in 30 runs and 
depending on two fixed factors and their interaction. The fixed factors are: the confidence 
coefficient C with four levels and the number of best individuals B with five levels. The 
linear model has the form: 



Aij = n + Ci + Bj + CBij + dj (14) 
i = 1,2,3,4; and j = 1,2,3,4,5 

where: 

• Cj is the effect over A of the i-th. level of factor C, where Ci represents a confidence 
coefiicient of 0.70, C2 of 0.90, C3 of 0.95 and C4 of 0.99. 

• Bj is the effect over A of the j-th. level of factor B, where Bi represents a value of 
n = 5, B2 of n = 10, S3 of n = 30, of n = 60 and of n = 90. 

• CBij represents the effect of the interaction between the confidence coefficient C and 
the number of best individuals B. 

• /Lt is the global mean of the model. The variation of the experimental results from ^ 
is explained by the effects of the different levels of the factors of the model and their 
interaction. 

• eij are error variables. 

The hypothesis tests try to determine the effect of each term over the fitness of the best 
individuals, A. We have carried out tests for every factor and for the interaction among 
the factors. This and subsequent tests are performed with a confidence level of 95%. The 
coefficient of the linear model tells us the percentage of variance of A that is explained 
by the model. 
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Function 


n 


1 - a 


Mean 


St. Dev. 


1 - a 


Mean 


St. Dev. 


1 - a 


Mean 


D. Tip. 


1 - a 


Mean 


St. Dev. 


Sphere 




0.70 


6.365C-16 


2.456C-16 


0.90 


4.8850-16 


1.969C-16 


0.95 


3.653C-16 


1.710O-16 


0.99 


1.998e-16 


6.775e-17 




10 




5.736C-15 


2.495C-15 




2.5540-15 


8.934e-16 




2.642C-15 


1.2580-15 




1.480O-15 


1.032C-15 


30 




3.728e-12 


1.623e-12 




1.446e-ll 


7.062e-12 




2.279e-ll 


1.256e-ll 




1.248e-10 


5.914e-ll 




60 




6.082e-10 


2.499e-10 




2.867e-08 


1.642e-08 




1.667e-07 


9.911e-08 




5.494e-07 


6.029e-07 




90 




3.838e-09 


2.326e-09 




4.383e-08 


3.068e-08 




6.840e-08 


5.894e-08 




1.061e-07 


8.401e-08 


Schwcfcl's 


5 


0.70 


1.995e-03 


2.280e-03 


0.90 


8.403O-03 


7.748e-03 


0.95 


7.662C-03 


9.693O-03 


0.99 


1.305C-02 


1.303C-02 


double sum 


10 




2.232e-02 


2.859e-02 




5.407e-02 


3.792e-02 




4.168e-02 


4.383e-02 




1.462e-02 


1.422e-02 


fSchDS 


30 




8.464e-02 


1.168e-01 




3.190e-01 


2.798e-01 




2.644e-01 


2.569e-01 




1.223e-01 


9.0186-02 




60 




1.376e-01 


1.202e-01 




4.059e-01 


2.396e-01 




2.223e-01 


1.384e-01 




2.134e-01 


1.464e-01 




90 




8.048O-01 


5.403O-01 




2.257C+00 


1.490O+00 




7.048C-01 


7.689O-01 




2.799O-01 


2.322C-01 


Rosenbrock 


5 


0.70 


2.494e+01 


1.283e+00 


0.90 


2.506e+01 


3.050e-01 


0.95 


2.497e+01 


4.663e-01 


0.99 


2.463e+01 


1.330e+00 


/Ros 


10 




2.679B+01 


2.044e-01 




2.691e+01 


1.324B-01 




2.589e+01 


9.426e-02 




2.579e+01 


1.609B-01 




30 




2.611e+01 


1.471e-01 




2.632e+01 


1.746e-01 




2.642e+01 


1.377e-01 




2.6688+01 


9.999e-02 




60 




2.576e+01 


1.988e-01 




2.693e+01 


2.292e-01 




2.600e+01 


4.045e-01 




2.617e+01 


4.787B-01 




90 




2.662e+01 


2.827e-01 




2.670e+01 


2.974e-01 




2.579e+01 


2.629e-01 




2.585e+01 


3.654e-01 


Rastrigin 


5 


0.70 


2.919e+00 


1.809e+00 


0.90 


6.036e+00 


2.023O+00 


0.95 


7.893O+00 


2.450e+00 


0.99 


7.164C+00 


2.579O+00 


fRae 


10 




6.799C+00 


2.480O+00 




1.068e+01 


3.786O+00 




1.297O+01 


3.844e+00 




1.675e+01 


6.554O+00 




30 




9.452e+00 


2.434e+00 




1.270e+01 


3.522e+00 




1.327e+01 


4.770e+00 




1.552e+01 


3.664e+00 




60 




1.413e+01 


4.126e+00 




1.837e+01 


6.070e+00 




1.499e+01 


4.434e+00 




1.691e+01 


4.123e+00 




90 




1.771e+01 


5.063e+00 




2.438e+01 


7.688e+00 




1.987e+01 


5.637e+00 




2.249e+01 


6.058e+00 


Schwefel 




0.70 


6.410e + 02 


2.544e+02 


0.90 


1.145C+03 


5.422O+02 


0.95 


1.424C+03 


6.837C+02 


0.99 


2.844e+03 


4.168e+02 


fSch 


10 




1.793e+03 


4.172e+02 




1.325e+03 


2.340e+02 




1.486e+03 


2.607e+02 




2.525e+03 


3.069e+02 




30 




2.676e+03 


2.B92e+02 




2.264e+03 


2.7B8e+02 




2.061e+03 


2.369e+02 




1.986e+03 


2.424e+02 




60 




2.700e+03 


1.471e+02 




2.613e+03 


1.927e+02 




2.496e+03 


2.146e+02 




2.169e+03 


2.434e+02 




90 




2.738e+03 


1.476e+02 




2.704e+03 


1.516e+02 




2.672O+03 


1.349C+02 




2.529e+03 


1.837e+02 



Ackley 


5 


0.70 


1.378e-08 


5.677e-09 


0.90 


6.3206-09 


2.966e-09 


0.95 


4.677e-09 


1.960e-09 


0.99 


5.1886-09 


2.8836-09 


fAck 


10 




2.074e-07 


9.033e-08 




9.5446-08 


3.422e-08 




9.396e-08 


3.5136-08 




5.8066-08 


2.6836-08 




30 




8.328e-06 


1.403e-06 




1.4836-05 


3.956e-06 




2.246e-05 


4.9576-06 




4.9766-05 


1.2986-06 




60 




1.019e-04 


2.396e-05 




8.2926-04 


2.097e-04 




1.897e-03 


9.1906-04 




3.2046-03 


1.3736-03 




90 




2.518e-04 


7.1676-05 




7.5446-04 


2.6686-04 




9.5716-02 


3.6096-01 




1.7416-01 


6.2906-01 


Griewangk 


5 


0.70 


1.525O-02 


1.387O-02 


0.90 


2.4636-02 


2.570e-02 


0.95 


1.574e-02 


1. 4110-02 


0.99 


1.285O-02 


1.801C-02 


fori 


10 




1.647O-02 


1.951O-02 




2.695O-02 


2.713e-02 




2.195e-02 


2.248O-02 




3.194O-02 


3.6S0C-02 




30 




2.012e-02 


2.3726-02 




1.8196-02 


1.664e-02 




2.321e-02 


3.8426-02 




2.2546-02 


1.8776-02 




60 




7.884e-03 


1.0616-02 




2.8086-02 


9.6866-02 




7.4106-03 


1.3216-02 




1.5826-02 


2.7276-02 




90 




7.391e-03 


7.6176-03 




6.248e-03 


e.741e-03 




8.938e-03 


1.1966-02 




1.2306-02 


2.3666-02 


Fletcher 




0.70 


1.523e+04 


1.506O+04 


0.90 


2.293e+04 


1.882O+04 


0.95 


1.286e + 04 


1.317e+04 


0.99 


1.527e+04 


1.362O+04 


SfIc 


10 




1.966e+04 


1.5856+04 




2.248e+04 


2.3006+04 




1.6336+04 


1.344e+04 




1.8916+04 


1.6126+04 




30 




2.1458+04 


1.6316+04 




2.1296+04 


1.3106+04 




3.0496+04 


2.3066+04 




2.4926+04 


1.9676+04 




60 




2.133e+04 


2.1106+04 




2.124e+04 


1.2136+04 




2.9356+04 


2.1656+04 




2.3746+04 


1.4796+04 




90 




2.432e+04 


2.273e+04 




2.898e+04 


3.1316+04 




2.918e+04 


2.418e+04 




3.453C+04 


2.498O+04 


Langerman 


5 


0.70 


-2.0640-01 


9.346C-02 


0.90 


-2.544e-01 


1.401e-01 


0.95 


-3.545e-01 


1.802e-01 


0.99 


-2.803C-01 


1.350C-01 


fLan 


10 




-2.339C-01 


1.280C-01 




-2.5S2e-01 


1.574e-01 




-2.663e-01 


1.247O-01 




-2.830C-01 


1.645O-01 


30 




-2.124C-01 


1.038O-01 




-2.191e-01 


l.lOOe-01 




-1.908e-01 


9.776O-02 




-2.382C-01 


1.572e-01 




60 




-1.975e-01 


1.4056-01 




-1.752e-01 


7.145e-02 




-1.762e-01 


8.9296-02 




-1.9496-01 


9.6006-02 




90 




-1.5996-01 


9.0576-02 




-1.3366-01 


6.0426-02 




-1.6666-01 


8.3366-02 




-1.7966-01 


8.4636-02 



Table 2: Average value and standard deviation of the 30 runs for each experiment 
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For determining whether there are significant differences among the various levels of 
a factor, we perform a multiple comparison test of the average fitness obtained with the 
different levels of each factor. First, we carry out a Levene test (Miller, 1996; Levene, 
1960) for evaluating the equality of variances. If the hypothesis that the variances are 
equal is accepted, we perform a Bonferroni test (Miller, 1996) for ranking the means of 
each level of the factor. Our aim is to find the level of each factor whose average fitness 
is significantly better than the average fitness of the rest of the levels of the factor. If 
the test of Levene results in rejecting the equality of covariance matrixes, we perform a 
Tamhane test (Tamhane k. Dunlop, 2000) instead of a Bonferroni test. Tables 9, 12, and 
13 in Appendix A show the results obtained following the above methodology. 

For Sphere function, the significant levels a* of each term of the linear model on Table 9 
show that none of the factors of the linear model has a significant effect on the model built 
to explain the variance of the fitness A. This effect is due to the fact that fsph is easy to 
optimize and the fitness behaves as a singular random variable with sample variance near 
0. We can see in Table 2 that the best results arc obtained with the pair (0.99,5). The 
multiple comparison test of Table 12 confirms that the means obtained with the value n = 5 
are significatively better than the means obtained with other values. In the same way, the 
average fitness for 1 — a = 0.70 is significantly the best one. The results show that, for any 
value of n, the best value of 1 — a, in general, is 1 — a = 0.70. Due to the simple form of 
fsph-, the best parameters of the crossover show a high exploitative component with a fast 
shifting towards the region of the best individuals. 

For the unimodal and non-separable functions fschDS and /ros, both factors and their 
interaction are significant in the linear model that explains the sample variance of A with 
a determination coefficient around 0.5. Table 2 shows that the best results are obtained 
with n = 5; the Tamhane test shows that the means obtained with this value of n are 
significatively better than the means obtained with other values. The results for the value 
of the confidence coefficient are less conclusive. In fact, for /^os there are no significant 
differences among the different values of 1 — a, although the best results are obtained with 
1 — a = 0.7. For fschDS the average fitness for /Lto.99 is the best one, but without significant 
differences with /io.ro- IJ-o.io together with n = 5 is the one that shows the best results. We 
can conclude that the feature of non-separability of the functions does not imply a notable 
change in the parameters of the crossover with respect to the parameters used for fsph- 

For fjias and fsch-, which are separable and multimodal, the most adequate pair of 
parameters is (0.70, 5). For fRas^ the test shows that the performance of this pair is signif- 
icantly better. However, for fsch^ the best mean is obtained with with results that are 
significantly better than these obtained with other values, with the exception of ji\Q. There 
are no significant differences among //o.70) /^o.95 and //go- The three factors of the linear 
model are significant with quite large determination coefficients of 0.617 for /^as and 0.805 
forfsch- This means that the factors and their interaction explain a high percentage of the 
variance of the fitness A. 

For fAck, the best results are obtained with the pair (0.95, 5). The Tamhane test confirms 
that n = 5 is the most suitable value, while there are no significant differences among /Lto.70) 
^0.95 and ^0.99- For fcri the best results are obtained with the pair (0.90,90). The test 
shows that large values of n are the most suitable for the optimization of this function. 
There are no significant differences among the performance of the different values of 1 — a. 
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For both functions the determination coefficient of the hnear model is low, showing that 
the linear model does not explain the variance of the fitness. The lack of a linear relation 
among n, 1 — a and the fitness makes it more difficult to determine the best value of the 
parameters of the crossover. 

The case of fpie and fLan is similar, as the linear model hardly gives any information 
about the effect of the parameters on the fitness. The most adequate pair for the optimiza- 
tion of these two functions is (0.95,5). The test shows that the best values of n are n = 5 
and n = 10. On the other hand, there are no significant differences among the performance 
of the crossover for the different values of 1 — a. 

The overall results show that the selection of the best n = 5 individuals of the population 
would suffice for obtaining a localization estimator good enough to guide the search process 
even for multimodal functions where a small value of n could favor the convergence to 
local optima. However, if the virtual parents have a worse fitness than the parent from the 
population, the offspring is generated near the latter, and the domain can be explored in 
multiple directions. In this way, the premature convergence to suboptimal virtual parents 
is avoided. 

However, if the best n individuals are concentrated in a local optimum the algorithm 
will very likely converge to such optimum. That is the reason why in complex functions a 
larger value of n may be reasonable, adding to the confidence interval individuals located in 
or near different optima. As an example of this, the case of fcri for which the best results 
are achieved with n = 90 and = 60 is noteworthy. 

The confidence coefficient bounds the error in the determination of the localization 
parameter and is responsible for focussing the search. The multiple comparison tests show 
that the value 1 — a = 0.70 is the best for 6 problems, and is, as least, no worse than the 
best one in the other problems. So it can be chosen as the most adequate value of the 
parameter. 

6. Compcirative Study of the Crossovers 

Due to the large amount of different crossovers available, it is unfeasible to make a com- 
prehensive comparison between all those crossovers and CIXL2. We have chosen those 
crossovers that obtain interesting results and those whose features are similar to our crossover, 
that is, which are self-adaptive and establish a balance between the exploration and the ex- 
ploitation of the search space. The way in which these two features arc balanced is regulated 
by one or more parameters of each crossover. These parameters have been chosen following 
the authors' recommendations and the papers devoted to the comparison of the different 
operators. 

The crossovers used in the comparison are: BLXa (Eshelman & Schaffer, 1993) with 
different degrees of exploration determined by the values a = {0.2,0.5} (Herrera et al., 
2003); fuzzy recombination (Voigt et al., 1995); based on fuzzy connectives of the logical 
family (logical crossover) (Herrera et al., 1998) using S2 strategies and A = 0.5 (Herrera & 
Lozano, 2000), SBX (Deb & Agrawal, 1995) using the values v = {2, 5} (Deb & Beyer, 2001); 
UNDX (Ono & Kobayashi, 1997) with cr^ = ^ and ar, = ^ (Kita, Ono, & Kobayashi, 1998; 
Kita, 2001). For CIXL2, as we have determined in the previous study, we will use n = 5 
and 1 - a = 0.70. 
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Following the setup of the previous study, we performed an ANOVA II analysis and 
a multiple comparison test. As might have been expected, keeping in mind the "no- free 
lunch" theorem and the diversity of the functions of the test set, the tests show that there is 
no crossover whose results are significatively better than the results of all other crossovers. 
This does not mean that these differences could not exist for certain kinds of functions. 
So, in order to determine for each kind of function whether a crossover is better than the 
others, we have performed an ANOVA I analysis — where the only factor is the crossover 
operator — and a multiple comparison test. Additionally, we graphically study the speed 
of convergence of the RCGA with regard to the crossover operator. In order to enforce the 
clearness of the graphics for each crossover, we show only the curve of the best performing 
set of parameters for BLX and SBX crossovers. 



Crossover 


Mean 


St. Dev. 


Mean 


St. Dev. 


Mean 


St. Dev. 




fSph 


fSchDS 


fplos 


CIXL2 


6.365e-16 


2.456e-16 


1.995e-03 


2.280e-03 


2.494e-|-01 


1.283e-|-00 


BLX(0.3) 


3.257e-16 


1.396C-16 


1.783C-02 


1.514e-02 


2.923e-|-01 


1.7236-1-01 


BLX(0.5) 


4. 737c- 16 


4.737C-16 


9.332C-03 


1.086e-02 


3.1616-1-01 


2.094C+01 


SBX(2) 


1.645C-12 


8.874C-13 


2.033C-01 


1.966C-01 


2.775C+01 


9.178C+00 


SBX(5) 


4.873C-12 


3.053C-12 


3.933C-01 


2.881C-01 


3.111C+01 


1.971C+01 


Ext. F. 


2.739C-15 


1.880C-15 


3.968c-f01 


1.760C+01 


2.743C+01 


1.394C+01 


Logical 


3.695e-13 


1.670e-13 


1.099e-|-01 


7.335e-|-00 


2.703e-|-01 


8.3586-02 


UNDX 


2.910e-05 


1.473e-05 


2.080e-|-01 


7.216e-t-00 


2.840e-t-01 


3.606e-01 



CIXL2 

BLX(0.3) 

BLX(0.5) 

SBX(2) 

SBX(5) 

Ext. F. 

Logical 

UNDX 

CIXL2 

BLX(0.3) 

BLX(0.5) 

SBX{2) 

SBX(5) 

Ext. F. 

Logical 

UNDX 



fRas 

2.919C+00 1.809C+00 
2.189e-|-00 1.4176-1-00 
3.0186-1-00 1.6836-1-00 



1.844e+01 
1.4196-1-01 
2.2456+01 
6.3256-1-01 
1.1076-F02 

fc, 
1.5256-02 
4.7496-02 

3.760C-02 
2.196C-02 
3.128C-02 
1.315e-03 
6.0786-03 
7.8376-02 



4.4176-1-00 
3.7046-1-00 
4.9146+00 
1.0126+01 
1.2426+01 

i 

1.3876-02 
4.5796-02 

2.874C-02 
1.874C-02 
2.737C-02 
3.470C-03 
6.457e-03 
4.4386-02 



fSch 

6.410C+02 2.544C+02 
3.695e+02 1.5956+02 
4.2006+02 1.9166+02 
1.4706+03 3.8276+02 
1.1046+03 3.3536+02 
3.0496+03 2.8766+02 
2.6296+03 9.7496+01 
8.0506+03 3.7416+02 

fpie 

1.523e+04 1.5066+04 
1.5706+04 



fAck 



1.8026+04 
3.263C+04 
3.333C+04 
1.691C+04 
2.718C+04 
3.4696+04 



1.5156+04 

1.483C+04 
3.110c 
2.973c 
1.446c 
1.388c 



04 
04 
04 
04 



2.1366+04 



1.378e-08 
4.2076-08 
6.4686-08 
5.3356-06 
9.6626-06 
1.7976-07 
2.5316-06 
3.5516-02 

fLan 

-2.0646-01 9. 
-3.0036-01 
-3.457e-01 

-1.939C-01 
-1.866C-01 
-1.064C-01 
-7.396C-08 
-2.1306-01 



677C-09 
7136-08 
9286-08 
4536-06 
3776-06 
8236-08 
1296-07 
2246-02 

3466-02 
3886-01 

684e-01 
086e-01 
080C-02 
517C-02 
218C-07 
1166-02 



Table 3: Average values and standard deviation for the 30 runs of every crossover operator. 



Table 3 shows the average values and standard deviations for the 30 runs performed for 
each crossover operator. Table 10 in Appendix A shows how, for all the functions, except 
/ros, the crossover operator has a significant effect on the linear model. The table also 
shows that the results of the Levene test indicate the inequality of the variances of the 
results of all the functions, excepting fpig. So, we use the Bonferroni test for fpie^ and the 
Tamhane test for all the others. The results of the multiple comparison test, the ranking 
established by the tests and the significant level of the differences among the results of the 
crossovers are shown on Tables 14, 15 and 16 (Appendix A). Figures 5 - 13, in Appendix 
B, show, in logarithmic scale, the convergence curves for each function. 
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For fsph the high value of the determination coefficient shows that the hnear model 
explains much of the variance of the fitness. The best values are obtained with BLX(0.3), 
BLX(0.5) and CIXL2, in this order. With these operators we obtain precisions around 
le-16. Figure 5 shows that CIXL2 is the fastest in convergence, but it is surpassed by BLX 
in the last generations. 

For fscliDS s-'^d Iros the best results are obtained with CIXL2. For fschDS the difference 
in performance with the other crossovers is statistically significant. For /ros the differences 
are significant, when CIXL2 is compared with Logical and UNDX. For fschDS the Figure 6 
shows how CIXL2 achieves a quasi-exponential convergence and a more precise final result. 
For /ros, in the Figure 7 we can see how the speed of convergence of CIXL2 is the highest, 
although the profile of all the crossovers is very similar with a fast initial convergence 
followed by a poor evolution due to the high epistasis of the function. The differences in 
the overall process are small. This fact explains that in the linear model the influence of 
the factor crossover is not significant and the determination coefficient is small. 

For fRas, BLX(0.3) again obtains the best results but without significant difference to the 
average values obtained with CIXL2 and BLX(0.5). These three operators also obtain the 
best results for fsch', however, the tests show that there are significant differences between 
CIXL2 and BLX(0.5), and that there are no differences between BLX(0.5) and BLX(0.3). 
The latter obtains the best results. Figures 8 and 9 show that BLX is the best in terms of 
convergence speed followed by CIXL2. The large value of means that the crossover has 
a significant influence on the evolutive process. 

For fAckj CIXL2 obtains significantly better results. In Figure 10 we can sec how it 
also converges faster. The large value of means that the crossover has a significant 
infiuence on the evolutive process. For fcri, the Fuzzy operator obtains significantly better 
results. The following ones, with significant differences between them, are Logical and 
CIXL2. Figure 11 shows a fast initial convergence of CIXL2, but in the end Logical and 
Fuzzy obtain better results. 

For fpif, the best results are obtained with CIXL2, but the difference is only significant 
with SBX and UNDX. Figure 12 shows that CIXL2 is the fastest in convergence, but with 
a curve profile similar to BLX and Fuzzy. For fLan, the best operator is BLX(0.5), with 
differences that are significant for all the other operators with the exception of BLX(0.3). 
UNDX and CIXL2 are together in third place. Figure 13 shows that the behavior of all 
crossovers is similar, except for the Logical crossover that converges to a value far from the 
other operators. 

7. Compcirison with Estimation of Distribution Algorithms 

ED As are evolutionary algorithms that use, as CIXL2, the best individuals of the population 
to direct the search. A comparison with this paradigm is interesting, although there are 
significant differences between EDAs and RCGAs. 

EDAs remove the operators of crossover and mutation. In each generation a subset of the 
population is selected and the distribution of the individuals of this subset is estimated. The 
individuals of the population for the next generation are obtained sampling the estimated 
distribution. Although any selection method could be applied, the most common one is the 
selection of the best individuals of the population. 
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The first EDAs were developed for discrete spaces. Later, they were adapted to con- 
tinuous domains. We can distinguish two types of EDAs, whctlicr they take into account 
dependencies between the variables or not. One of the most used among the EDAs that do 
not consider dependencies is U MDA^ (Univariate Marginal Distribution Algorithm for con- 
tinuous domains) (Larranaga, Etxeberria, Lozano, &; Peha, 2000). In every generation and 
for every variable the UMDAc carries out some statistical test in order to find the density 
function that best fits the variable. Once the densities have been identified, the estimation 
of parameters is performed by their maximum likelihood estimates. If all the distributions 
are normal, the two parameters are the mean and the standard deviation. This particular 
case will be denoted UMDAf (Univariate Marginal Distribution Algorithm for Gaussian 
models) . 

Among the other type of EDAs, we can consider EGNAbgc (Estimation of Gaussian 
Network Algorithm) (Larranaga et al., 2000) whose good results in function optimization 

are reported by Bcngoctxca and Miquelez (2002). In each generation, EGNAbgc learns 
the Gaussian network structure by using a Bayesian score that gives the same value for 
Gaussian networks reflecting the same conditional dependencies are used. Next, it calculates 
estimations for the parameters of the Gaussian network structure. 

In the experiments we have used the parameters reported by Bengoetxea and T. Miquelez 
(2002): a population of 2000 individuals, initialized using a uniform distribution, from which 
a subset of the best 1000 individuals are selected to estimate the density function, and the 
elitist approach was chosen (the best individual is included for the next population and 1999 
individuals are simulated). Each algorithm has been run 30 times with a stop criterion of 
300,000 evaluations of the fitness function. 

The results of EDAs are compared with the results of a RCGA with CIXL2 of parameters 
n = 5 and 1 — a = 0.70. We performed an ANOVA I analysis where the three levels of the 
factor are the different algorithms: RCGA with CIXL2, UMDAc and EGNAbgc- We also 
carried out a multiple comparison test. 

Table 4 shows the average values and standard deviations for 30 runs for each algorithm. 
Table 11 in Appendix A shows how, for all the functions excepting fAck-, the type of algo- 
rithm has a significant effect over the linear model and exist inequality of the variances of 
the results (Levene test). So, we have used Tamhane test for all the functions and Bonfcr- 
roni test for JacH- Table 17 (Appendix A) shows the results of the multiple comparison test 
and the ranking established by the test. 

For fsph the results are very similar. The fitness behaves as a singular random variable 
with sample variance near and the statistical tests are not feasible. 

For fschDS results of CIXL2 are significantly better than the results of U MDAc and 
EGNAsGe- The same situation occurs for /ros, fRas-, fsch and f'Ack, with the exception 
that in these four functions there are no significant differences between the two EDAs. For 
fGri, EGNAbgc and UMDAc achieve the best results, significantly better than CIXL2. 
For fpie, UMDAc is significantly better than EGNAbgc and CIXL2, but there are no 
differences between these two. For fLan, CIXL2 obtains the best results, but there are no 
significant differences among the three algorithms. 

The estimation of the distribution function of the best individuals of the population 
performed by EDAs is an advantage in fsph, unimodal and separable, and fcri and fAck 
whose optima are regularly distributed. The results of EDAs for fcri are better than 
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the results of CIXL2, but the results for f^ck are worse. The results for fsph of all the 
algorithms arc very similar. For non-separable unimodal functions, such as fschDS ^ind Jros, 
the interdependence among their variables should favor the performance of EGNAbgc over 
UMDAc and CIXL2. Nevertheless, CIXL2 achieves the best results for these two functions. 
For multimodal separable functions, fnas and fsch, it is difficult to identify the distribution 
of the best individuals and the performance of EDAs is below the performance of CIXL2. 

For extremely complex functions, such as fpie and fLan, the results are less conclusive. 
For fpig the best results are obtained with U MDAc, and there are no differences between 
EGNAbgb and CIXL2. For fLan, CIXL2 achieves the best results, but the differences 
among the three algorithms are not statistically significant. 



EA Mean St. Dev. Mean St. Dev. Mean St. Dev. 

fsph fschDS fRos 

CIXL2 6.365e-16 2.456e-16 1.995e-03 2.280e-03 2.494e-|-01 1.283e+00 

UMDAc 1.196e-16 1.713e-17 2.221e-|-01 3.900e+00 2.787e+01 2.278e-02 

EGNAgoe 1.077e-16 l.OOle-17 2.096e-01 1.189e-01 2.785e+01 1.629e-01 

fRas fsch fAck 

CIXL2 2.919e+00 1.809e+00 6.410e+02 2.544e-l-02 1.378e-08 5.677e-09 

UMDAc 1.576C+02 7.382c+00 1.153c+04 9.167c+01 2.478e-08 1.831e-09 

EGNAbgc 1.563C+02 8.525c+00 1.155c+04 8.754c+01 2.297e-08 2.095e-09 

fcri fpie fLan 

CIXL2 1.525C-02 1.387c-02 1.523c+04 1.506c+04 -2.064e-01 9.346c-02 

UMDAc 9.465C-16 1.207c-16 5.423e-|-03 1.562c+03 -1.734c-01 4.258c-ll 

EGNAsGe 8.200e-16 1.149e-16 9.069e+03 7.592e+03 -1.734e-01 1.864e-ll 

Table 4: Average values and standard deviation for the 30 runs of three evolutionary algo- 
rithms: RCGA with CIXL2 crossover, UMDA^ and EGNAbgc- 



8. Application to Artificial Intelligence 

Genetic algorithms have been applied to almost any kind of problem, such as, object recog- 
nition for artificial vision (Singh, Chatterjee, &; Chaudhury, 1997; Bebis, Louis, Varol, &: 
Yfantis, 2002), robotics path planing (Davidor, 1991; Sedighi, Ashenayi, Manikas, Wain- 
wright, k. Tai, 2004), parameter estimation (Johnson k. Husbands, 1990; Ortiz-Boyer, 
Hervas-Martmez, & Muiioz-Perez, 2003), instance selection (Cano, Hcrrcra, &: Lozano, 
2003; Kuncheva, 1995), reinforcement learning (Moriarty, Schultz, & Grefenstette, 1999), 
and neural network (Miller, Todd, k. Hedge, 1991; Andersen k. Tsoi, 1993; Bebis, Geor- 
giopoulos, k Kasparis, 1997) and ensemble design (Zhou, Wu, k Tang, 2002). 

Real-coded genetic algorithms using CIXL2 can be applied to any of these problems 
provided they are defined in a continuous domain. We have chosen an application of RCGAs 
to the estimation of the weight of each network in an ensemble. This is an interesting 
problem where standard methods encounter many difficulties. 

8.1 Estimation of the Weights of the Networks of an Ensemble 

Neural network ensembles (Perrone k Cooper, 1993) (Garcia-Pedrajas, Hervas-Martmez, 
k Ortiz-Boyer, 2005) are receiving increasing attention in recent neural network research, 
due to their interesting features. They are a powerful tool specially when facing complex 
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problems. Network ensembles are made up of a linear combination of several networks that 
have been trained using the same data, although the actual sample used by each network to 
learn can be different. Each network within the ensemble has a potentially different weight 
in the output of the ensemble. Several papers have shown (Perrone & Cooper, 1993) that 
the network ensemble has a generalization error generally smaller than that obtained with 
a single network and also that the variance of the ensemble is lesser than the variance of a 
single network. The output of an ensemble, y, when an input pattern x is presented, is: 



where yi is the output of network z, and Wi is the weight associated to that network. If the 
networks have more than one output, a different weight is usually assigned to each output. 
The ensembles of neural networks have some of the advantages of large networks without 
their problems of long training time and risk of over-fitting. 

Moreover, this combination of several networks that cooperate in solving a given task 
has other important advantages, such as (Liu, Yao, & Higuchi, 2000; Sharkey, 1996): 

• They can perform more complex tasks than any of their subcomponents. 

• They can make an overall system easier to understand and modify. 

• They are more robust than a single network. 

Techniques using multiple models usually consist of two independent phases: model 
generation and model combination (Merz, 1999b). Once each network has been trained and 
assigned a weights (model generation), there are, in a classification environment three basic 
methods for combining the outputs of the networks (model combination): 

1. Majority voting. Each pattern is classified into the class where the majority of net- 
works places it (Merz, 1999b). Majority voting is effective, but is prone to fail in two 



(a) When a subset of redundant and less accurate models comprise the majority, and 

(b) When a dissenting vote is not recognized as an area of specialization for a par- 
ticular model. 

2. Sum of the outputs of the networks. The output of the ensemble is just the sum of 
the outputs of the individual networks. 

3. Winner takes all. The pattern is assigned to the class with the highest output over all 
the outputs of all the networks. That is, the network with the largest outputs directly 
classify the pattern, without taking into account the other networks. 

The most commonly used methods for combining the networks are the majority voting 
and sum of the outputs of the networks, both with a weight vector that measures the 



k 




(15) 



1=1 



scenarios: 
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confidence in the prediction of each network. The problem of obtaining the weight vector 
a is not an easy task. Usually, the values of the weights ctj are constrained: 

N 

Y.oti = l, (16) 

1=1 

in order to help to produce estimators with lower prediction error (Leblanc & Tibshirani, 
1993), although the justification of this constraint is just intuitive (Breiman, 1996). When 
the method of majority voting is applied, the vote of each network is weighted before it is 
counted: 



F{x) = arg maXy Oj. (17) 

i:fi{x)=y 

The problem of finding the optimal weight vector is a very complex task. The "Basic 
ensemble method (BEM)" , as it is called by Pcrronc and Cooper (1993), consists of weighting 
all the networks equally. So, having A'" networks, the output of the ensembles is: 



1 ^ 



Perrone and Cooper (1993) defined the Generalized Ensemble Method, which is equiv- 
alent to the Mean Square Error - Optimal Linear Combination (MSE-OLC) without a 
constant term of Hashem (Hashem, 1997). The form of the output of the ensemble is: 



N 



fGEuix) = ^ aifi{x), (19) 



i=l 



where the a'^s are real and satisfy the constraint = 1. The values of are given 

by: 

a. = (20) 
Z^k l^j ^kj 

where Cij is the symmetric correlation matrix Cij = E[mi{x)mj{x)], where mk{x) defines 
the misfit of function k, that is the deviation from the true solution f{x), m,k{x) = f{x) — 
fkix). The previous methods are commonly used. Nevertheless, many other techniques 
have been proposed over the last few years. Among others, there arc methods based on 
linear regression (Leblanc & Tibshirani, 1993), principal components analysis and least- 
square regression (Merz, 1999a), correspondence analysis (Merz, 1999b), and the use of a 
validation set (Opitz & Shavlik, 1996). 

In this application, we use a genetic algorithm for obtaining the weight of each com- 
ponent. This approach is similar to the use of a gradient descent procedure (Kivinen & 
Warmuth, 1997), avoiding the problem of being trapped in local minima. The use of a 
genetic algorithm has an additional advantage over the optimal linear combination, as the 
former is not affected by the collinearity problem (Perrone & Cooper, 1993; Hashem, 1997). 
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8.1.1 Experimental Setup 

Each set of available data was divided into two subsets: 75% of the patterns were used 
for learning, and the remaining 25% for testing the generalization of the networks. There 
are two exceptions, Sonar and Vowel problems, as the patterns of these two problems are 
prearranged in two specific subsets due to their particular features. A summary of these 
data sets is shown in Table 5. No validation set was used in our experiments. 



Data set 


Cases 


Classes 


Features 


Inputs 


Train 


Test 


C 


B 


N 


Anneal 


674 


224 


5 


6 


14 


18 


59 


Autos 


154 


51 


6 


15 


4 


6 


72 


_1 J (Xi. CXii. 1 V -\ 


469 


156 


3 


4 






4 


Rrpast-ranrPT' 


215 


71 


2 




3 


6 


15 


Card 


518 


172 


2 


6 


4 


5 


51 




750 


250 


2 


6 


3 


11 


61 


Glass 


161 


53 


6 


9 






9 




226 


76 


2 


6 


3 


4 


22 


TTenatit.iR 

J. J-V^ LyC^U J. U J.O 


117 


38 


2 


6 


13 




19 


Horse 


273 


91 


3 


13 


2 


5 


58 


Ionosphere 


264 


87 


2 


33 


1 




34 


Iris 


113 


37 


3 


4 






4 


Labor 


43 


14 


2 


8 


3 


5 


29 


Liver 


259 


86 


2 


6 






2 


Lymphography 


111 


37 


4 




9 


6 


38 


Pima 


576 


192 


2 


8 






8 


Promoters 


80 


26 


2 






57 


114 


Segment 


1733 


577 


7 


19 






19 


Sonar 


104 


104 


2 


60 






60 


Soybean 


513 


170 


19 




16 


19 


82 


TicTacToe 


719 


239 


2 






9 


9 


Vehicle 


635 


211 


4 


18 






18 


Vote 


327 


108 


2 




16 




16 


Vowel 


528 


462 


11 


10 






10 


Zoo 


76 


25 


7 


1 


15 




16 



Table 5: Summary of data sets. The features of each data set can be C (continuous), 
B (binary) or N (nominal). The Inputs column shows the number of inputs of 
the network as it depends not only on the number of input variables but also on 
their type. 

These data sets cover a wide variety of problems. There are problems with different 
numbers of available patterns, from 57 to 2310, different numbers of classes, from 2 to 
19, different kinds of inputs, nominal, binary and continuous, and of different areas of 
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application, from medical diagnosis to vowel recognition. Testing our model on this wide 
variety of problems can give us a clear idea of its performance. These are all the sets to 
which the method has been applied. 

In order to test the efficiency of the proposed crossover in a classical artificial intelligence 
problem, we have used a RCGA to adjust the weight of each network within the ensemble. 
Our method considers each ensemble as a chromosome and applies a RCGA to optimize 
the weight of each network. The weight of each network of the ensemble is codified as a 
real number. The chromosome formed in this way is subject to CIXL2 crossover and non- 
uniform mutation. The parameters of CIXL2 are the same used in the rest of the paper, 
n = 5 and 1 — a = 0.7. The combination method used in the weighted sum of the outputs 
of the networks. Nevertheless, the same genetic algorithm could be used for weighting each 
network if a majority voting model is used. 

The exact conditions of the experiments for each run of all the algorithms were the 
following: 

• The ensemble was formed by 30 networks. Each network was trained separately using 
and standard back-propagation algorithm using the learning data. 

• Once the 30 networks have been trained, the different methods for obtaining the 
weights were applied. So, all the methods use the same ensemble of networks on each 
run of the experiment. For the genetic algorithm, the fitness of each individual of the 
population is the classification accuracy over the learning set. 

• After obtaining the vector of weights, the generalization error of each method is eval- 
uated using the testing data. 

Tables 6 and 7 show the results in terms of accurate classification for the 25 problems. 
The tables show the results using a RCGA with CIXL2, and the standard BEM and GEM 
methods. In order to compare the three methods we have performed a sign test over the 
win/draw/loss record of the three algorithms (Webb, 2000). These tests are shown in Table 
8. 

Table 8 shows the comparison statistics for the three models (Webb, 2000). For each 
model we show the win/draw/loss statistic, where the first value is the number of data sets 
for which col < row, the second is the number for which col = row, and the third is the 
number for which col > row. The second row shows the p-value of a two-tailed sign test 
on the win-loss record. The table shows that the genetic algorithm using CIXL2 is able 
to outperform the two standard algorithms BEM and GEM with a 10% confidence. On 
the other hand, there are no significant differences between BEM and GEM. This result 
is especially interesting because we have used a comprehensive set of problems from very 
different domains, different types of inputs, and different numbers of classes. 

9. Conclusions and Future Work 

In this paper we have proposed a crossover operator that allows the offspring to inherit 
features common to the best individuals of the population. The extraction of such common 
features is carried out by the determination of confidence intervals of the mean of the 



26 



CIXL2: A Crossover Operator for Evolutionary Algorithms 



Problem 



Anneal 



Autos 



Balance 



Breast 



Cancer 



Card 



German 



Glass 



Heart 



Hepa. 



Horse 



lonos. 



Iris 



Learning 



Test 





Mean 


St.Dev. 


Best 


Worst 


Mean 


St.Dev. 


Best 


Worst 




CIXL2 


0, 


.9933 


0.0046 


0, 


.9985 


0, 


.9777 


0, 


.9778 


0.0090 


0, 


.9911 


0, 


.9420 




BEM 


0, 


.9879 


0.0054 


0, 


.9955 


0, 


.9733 


0, 


.9729 


0.0091 


0, 


.9911 


0, 


.9464 


+ 


GEM 


0, 


.9915 


0.0054 


0, 


.9985 


0, 


.9777 


0, 


.9780 


0.0103 


0, 


.9911 


0, 


.9420 


- 


CIXL2 


0, 


.8957 


0.0233 


0, 


.9416 


0, 


.8506 


0, 


.7261 


0.0577 


0, 


.8235 


0, 


.5882 




BEM 


0, 


.8649 


0.0211 


0, 


.9091 


0, 


.8312 


0, 


.7052 


0.0586 


0, 


.8039 


0, 


.5686 


+ 


GEM 


0, 


.8740 


0.0262 


0, 


.9351 


0, 


.8182 


0, 


.7033 


0.0707 


0, 


.8039 


0, 


.5294 


+ 


CIXL2 


0, 


.9340 


0.0067 


0, 


.9446 


0, 


.9232 


0, 


.9201 


0.0118 


0, 


.9487 


0, 


.8910 




BEM 





.9179 


0.0068 


0, 


.9318 





.9019 


0, 


.9158 


0.0111 


0, 


.9423 


0, 


.8910 


+ 


GEM 


0, 


.9148 


0.0101 


0, 


.9318 


0, 


.8785 


0, 


.9158 


0.0110 


0, 


.9359 


0, 


.8910 


+ 


CIXL2 


0, 


.8575 


0.0195 


0, 


.8930 


0, 


.8047 


0, 


.6892 


0.0322 


0, 


.7465 


0, 


.6338 




BEM 


0, 


.8321 


0.0287 


0, 


.8698 


0, 


.7395 


0, 


.6826 


0.0375 


0, 


.7606 


0, 


.6056 


+ 


GEM 





.8274 


0.0314 


0, 


.8791 


0, 


.7488 





.6817 


0.0354 


0, 


.7324 


0, 


.6056 


+ 


CIXL2 


0, 


.9723 


0.0021 


0, 


.9771 


0, 


.9676 


0, 


.9799 


0.0065 


0, 


.9885 


0, 


.9655 




BEM 


0, 


.9678 


0.0034 


0, 


.9733 


0, 


.9600 


0, 


.9793 


0.0076 


0, 


.9943 


0, 


.9655 


+ 


GEM 


0, 


.9673 


0.0034 


0, 


.9733 


0, 


.9581 


0, 


.9785 


0.0084 


0, 


.9885 


0, 


.9598 


+ 


CIXL2 


0, 


.9201 


0.0087 


0, 


.9363 





.9054 


0, 


.8574 


0.0153 


0, 


.8895 


0, 


.8256 




BEM 


0, 


.9074 


0.0088 


0, 


.9247 


0, 


.8880 


0, 


.8521 


0.0212 


0, 


.8953 


0, 


.7965 


+ 


GEM 


0, 


.9049 


0.0093 


0, 


.9208 


0, 


.8822 


0, 


.8533 


0.0203 


0, 


.8953 


0, 


.7965 


+ 


CIXL2 


0, 


.8785 


0.0080 


0, 


.8973 


0, 


.8653 


0, 


.7333 


0.0184 


0, 


.7640 


0, 


.7000 




BEM 


0, 


.8587 


0.0090 


0, 


.8827 


0, 


.8440 


0, 


.7355 


0.0141 


0, 


.7600 


0, 


.7040 


- 


GEM 


0, 


.8642 


0.0099 


0, 


.8827 


0, 


.8427 


0, 


.7377 


0.0149 


0, 


.7680 


0, 


.7160 


- 


CIXL2 


0, 


.8509 


0.0225 


0, 


.9006 


0, 


.8075 


0, 


.6962 


0.0365 


0, 


.7736 


0, 


.6038 




BEM 


0, 


.8043 


0.0246 


0, 


.8447 


0, 


.7578 


0, 


.6824 


0.0424 


0, 


.7925 


0, 


.6038 


+ 


GEM 


0, 


.8246 


0.0293 


0, 


.8820 


0, 


.7640 


0, 


.6855 


0.0479 


0, 


.7736 


0, 


.6038 


+ 


CIXL2 


0, 


.9297 


0.0216 


0, 


.9653 


0, 


.8861 


0, 


.8358 


0.0271 


0, 


.8971 


0, 


.7794 




BEM 


0, 


.9089 


0.0214 


0, 


.9604 


0, 


.8663 


0, 


.8333 


0.0263 


0, 


.8824 


0, 


.7794 


+ 


GEM 


0, 


.9182 


0.0239 


0, 


.9554 


0, 


.8663 


0, 


.8279 


0.0312 


0, 


.8971 


0, 


.7794 


+ 


CIXL2 


0, 


.9385 


0.0224 


0, 


.9744 


0, 


.8718 


0, 


.8702 


0.0372 





.9211 


0, 


.8158 




BEM 


0, 


.9131 


0.0253 


0, 


.9573 


0, 


.8462 


0, 


.8658 


0.0319 


0, 


.9211 


0, 


.8158 


+ 


GEM 


0, 


.9179 


0.0289 


0, 


.9744 


0, 


.8376 


0, 


.8711 


0.0399 


0, 


.9474 


0, 


.7895 


- 


CIXL2 


0, 


.8723 


0.0174 


0, 


.9084 


0, 


.8315 


0, 


.7044 


0.0313 


0, 


.7692 


0, 


.6264 




BEM 


0, 


.8444 


0.0194 


0, 


.8718 


0, 


.7949 


0, 


.7000 


0.0301 


0, 


.7582 


0, 


.6374 


+ 


GEM 


0, 


.8485 


0.0207 


0, 


.8864 


0, 


.8095 


0, 


.7004 


0.0300 


0, 


.7802 


0, 


.6484 


+ 


CIXL2 


0, 


.9635 


0.0164 


0, 


.9886 


0, 


.9356 


0, 


.8950 


0.0225 


0, 


.9195 


0, 


.8276 




BEM 


0, 


.9481 


0.0171 


0, 


.9773 


0, 


.9167 


0, 


.8920 


0.0206 


0, 


.9195 


0, 


.8276 


+ 


GEM 


0, 


.9554 


0.0205 


0, 


.9886 


0, 


.9167 


0, 


.8958 


0.0198 


0, 


.9310 


0, 


.8621 




CIXL2 


1, 


.0000 


0.0000 


1, 


.0000 


1, 


.0000 


1, 


.0000 


0.0000 


1, 


.0000 


1, 


.0000 




BEM 


1, 


.0000 
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1, 
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1, 
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Table 6: Ensemble results using real-coded genetic algorithm (CIXL2), basic ensemble 
method (BEM), and generalized ensemble method (GEM). For each problem 
we have marked whichever CIXL2 is better (+), equal, (=), or worse (-) than 
BEM/GEM. 
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Table 7: Ensemble results using real-coded genetic algorithm (CIXL2), basic ensemble 
method (BEM), and generalized ensemble method (GEM). For each problem 
we have marked whichever CIXL2 is better (+), equal, (=), or worse (-) than 
BEM/GEM. 



28 



CIXL2: A Crossover Operator for Evolutionary Algorithms 



Algorithm 


BEM 


GEM 




CIXL2 


19/1/5 


17/1/7 


win/draw/loss 




0.0066 


0.0639 


p- value 


BEM 




9/4/12 


win/draw/loss 






0.6636 


p- value 



Table 8: Comparison of the three methods. Win/draw/loss record of the algorithms against 
each other and p- value of the sign test. 



best individuals of the population. From these confidence intervals, CIXL2 creates three 
virtual parents that are used to implement a directed search towards the region of the fittest 
individuals. The amplitude and speed of the search is determined by the number of best 
individuals selected and the confidence coefficient. 

The study carried out in order to obtain the best parameters for CIXL2 concludes that 
the value of n = 5 best individuals is suitable to obtain the localization estimator to guide 
the search in most of the problems tested. However, in very difficult problems, it would be 
advisable to have a larger value of n to avoid the premature convergence of the evolutionary 
process. The confident coefficient, 1 — a, is responsible, together with the dispersion of the 
best individuals, for the modulation of the wideness of the confidence interval centered on 
the localization estimator. The study results in the best value of 1 — a = 0.70. This pair of 
values has an acceptable performance for all problems, although there is not an optimum 
pair of values for all problems. 

The comparative analysis of the crossover operators shows that CIXL2 is a good alter- 
native to widely used crossovers such as BLX^ for unimodal function such as fsphj fschDS: 
and /ros- Noteworthy is the performance of CIXL2 in the two non-separable functions, 
fschDS ctnd /ros, where the other crossovers have a disparate behavior. 

If in unimodal functions the strategy of extracting the statistical features of localization 
and dispersion of the best individuals is a guarantee of good performance, the case for 
multimodal functions is quite different, and the performance of the algorithm is not assured 
a priori. Nevertheless, the results obtained for this kind of functions show that CIXL2 is 
always one of the best performing operators. For instance, in functions of a high complexity 
such as fAck — multimodal, non-separable and regular — and fpie — multimodal, non- 
separable and irregular — CIXL2 obtains the best results. This behavior reveals that 
the determination of the region of the best individuals by means of confidence intervals 
provides a robust methodology that, applied to crossover operator, shows an interesting 
performance even in very difficult functions. In summary, we can affirm that this paper 
proves that CIXL2 is a promising alternative to bear in mind, when we must choose which 
crossover to use in a real-coded genetic algorithm. 

EDAs have shown very good performance for unimodal and separable functions, fsph^ 
and for functions whose optima are regularly distributed, f'Ack and fcri- The performance 
of EDAs decreases in multimodal, fnas and fsch, and epistatic functions, fschDS and Jros- 
On the other hand, CIXL2 is less sensitive to the type of function. The main reason for this 
behavior may be found in the fact that CIXL2 uses the distribution information obtained 
from the best individuals of the population differently. CIXL2 creates three virtual parents 
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from this distribution, but if the virtual parents have worse fitness than the individual 
which mates, the offspring is not generated near these virtual parents. In this way, CIXL2 
prevents a shifting of the population to the confidence interval if the improvement of the 
performance is not significant. 

The applicability of the proposed crossover to a problem of artificial neural network 
ensembles shows how this model can be used for solving standard artificial intelligence 
problems. RCGAs with CIXL2 can also be used in other aspects of ensemble design, such 
as, selection of a subset of networks, and sampling of the training set of each network. 

These promising results motivate the beginning of a new line of research geared to the 
study of the distribution of the best individuals taking into account the kind of problem at 
hand. We aim to propose new techniques of selection of individuals to be considered for 
obtaining the confidence interval in a more reliable way. In multimodal, irregular, or with 
many chaotically scattered optima functions the difficulty of obtaining the distributions 
of the best individuals is enormous. In these kind of functions it would be interesting to 
perform a cluster analysis of the selected best individuals and to obtain a confidence interval 
for every cluster. This idea would allow the implementation of a multi-directional crossover 
towards different promising regions. 

On the other hand, it is likely that as the evolutive process progresses the distribution 
of the best individuals changes. In such a case, it would be advisable to perform, at regular 
intervals, statistical tests to determine the distribution that best reflects the features of the 
best individuals on the population. 

Alternatively, we arc considering the construction of non-parametric confidence intervals. 
In this way, we need more robust estimators of the parameters of localization and dispersion 
of the genes of the best individuals. We have performed some preliminary studies using the 
median and different measures of dispersion and the results are quite encouraging. 

Another research line currently open is the study of the application of CIXL2 to prob- 
lems of optimization with restrictions, especially in the presence of non-linearity, where 
the generation of individuals in the feasible region is a big issue. The orientation of the 
search based on the identification of the region of the best individuals that is implemented 
by CIXL2 could favor the generation of feasible individuals. This feature would be an 
interesting advantage with respect to other crossover operators. 
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Appendix A. Results of the Statistical Study 



Function 


a* C 


a* B 


a* CB 




a* T. Levene 


fSph 


1.000 


1.000 






0.000 


fSchDS 


0.000 


0.000 


0.000 


0.601 


0.000 


Iros 


0.005 


0.000 


0.006 


0.526 


0.000 


fRas 


0.000 


0.000 


0.000 


0.617 


0.000 


fSch 


0.000 


0.000 


0.000 


0.805 


0.000 


fAck 


0.095 


0.000 


0.019 


0.083 


0.000 


fCri 


0.149 


0.001 




0.040 


0.000 


fple 


0.410 


0.000 




0.054 


0.003 


fhan 


0.040 


0.000 


0.024 


0.159 


0.000 



Table 9: Significant levels, a*, of each term of the linear model, determination coefficient 
E?, and value of Levene test of the statistical analysis of CIXL2 parameters. 



Function 


(1* Crossover 




(I Levene test 


fsph 


0.000 


0.779 


0.000 


fSchDS 


0.000 


0.786 


0.000 


fRos 


0.573 


0.024 


0.000 


fRas 


0.000 


0.971 


0.000 


fSch 


0.000 


0.987 


0.000 


fAck 


0.000 


0.884 


0.000 


fOri 


0.000 


0.421 


0.000 


fple 


0.000 


0.137 


0.091 


fhan 


0.000 


0.486 


0.000 



Table 10: Significance level of the crossover operator and determination coefficient of the 
linear model, and value of Levene test of the comparative study of the crossovers. 



Function 


a* EA 




a* Levene test 


fSchDS 


0.000 


0.955 


0.000 


fRos 


0.000 


0.778 


0.000 


fRas 


0.000 


0.992 


0.000 


fSch 


0.000 


0.999 


0.000 


fAck 


1.000 


0.641 


1.000 


fori 


0.000 


0.455 


0.000 


fpie 


0.001 


0.150 


0.000 


fhan 


0.027 


0.079 


0.000 



Table 11: Significance level of the evolutionary algorithms and determination coefficient i?^ 
of the linear model, and value of Levene test of the comparative study betwen 
CIXL2 and EDAs. 
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I 


J 


Ml - MJ 


a* 


Ml - MJ 




a* 


Ml - MJ 


a* 






%ph 




%chDS 








6 


10 


-2.683e-15 


0.000 


-2.540e-02 




0.000 


-9.433e-01 


0.000 




30 


-4.144C-11 


0.000 


-1.899e-01 




0.000 


-1.486e+00 


0.000 




60 


-1.836C-07 


0.000 


-2.371c-01 




0.000 


-1.058e+00 


0.000 




90 


-5.554e-08 


0.000 


-1.004e+00 




0.000 


-8.37Be-01 


0.000 


10 


5 


2.683e-lB 


0.000 


2.540e-02 




0.000 


9.433e-01 


0.000 




30 


-4.144e-ll 


0.000 


-1.64Be-01 




0.000 


-B.42Be-01 


0.000 




60 


-1.836e-07 


0.000 


-2.117C-01 




0.000 


-1.142e-01 


0.025 




90 


-5.554C-08 


0.000 


-9.78.5c-()l 




().()()() 


1.058e-01 


0.014 


30 


5 


4.144e-ll 


0.000 


1.899e-01 




0.000 


1.486e+00 


0.000 




10 


4.144e-ll 


0.000 


1.645e-01 




0.000 


B.42Be-01 


0.000 




60 


-1.836e-07 


0.000 


-4.720e-02 




0.B72 


4.283e-01 


0.000 




90 


-5.549e-08 


0.000 


-8.140e-01 




0.000 


6.483e-01 


0.000 


60 


5 


1.836O-07 


0.000 


2.371C-01 




0.000 


1.058C+00 


0.000 




10 


1.836e-07 


0.000 


2.117e-01 




0.000 


1.142e-01 


0.025 




30 


1.83Be-07 


0.000 


4.720e-02 




0.B72 


-4.283e-01 


0.000 




90 


1.281e-07 


0.003 


-7.668e-01 




0.000 


2.200e-01 


0.000 


90 


5 


B.BB4e-08 


0.000 


1.004O+00 




0.000 


8.37,5o-01 


0.000 




10 


B.BB4e-08 


0.000 


9.785C-01 




0.000 


-l.()58o-01 


0.014 




30 


B.B49e-08 


0.000 


S.140e-01 




0.000 


-6.483O-01 


0.000 




60 


-1.281e-07 


0.003 


7.668e-01 




0.000 


-2.200e-01 


0.000 


Hanking 


M60 > M90 > M30 > MIO > M5 


M90 > M30 > 


MIO > M5 


M30 > M60 > MIO > M90 > M5 










M60 > 


M5 












fRas 




fSch 




f A 1 




5 


10 


-5.79e+00 


0.000 


-2.691e+02 




0.082 


1 n63*i 07 


0.000 




30 


-6.72e+00 


0.000 


-7.338e+02 




0.000 


-2.384e-0B 


0.000 




60 


-l.Ole+01 


0.000 


-9.S59e+02 




0.000 


-l.B08e-03 


0.000 




90 


-l.Slc+Ol 


0.000 


-1.148C+03 




0.000 


-6.769e-02 


0.216 


10 


6 


5.79O+00 


0.000 


2. 6910+02 




0.082 


1.063O-07 


0.000 




30 


-9.31e-()l 


0.807 


-4,fi47c + ()2 




().()(>() 


-2.373o-()5 


0.000 




(iO 


-4.:iii.' ■ (JO 


().()()() 


-(j.8Ij8.' ■ (12 




().()()() 


-l.">()Sc-():! 


o.ooo 




90 


-9.:i2c — uu 


IJ.IJIJIJ 


-8,781k; — (12 




o.ooo 


-(i.7(i9c>-()2 


(J.2ilj 


30 




6.72C+00 


0.000 


7.338C+02 




0.000 


2.384O-05 


0.000 




10 


9.31C-01 


0.807 


4.647O+02 




0.000 


2.373O-05 


0.000 




60 


-3.37e+00 


0.000 


-2.221e+02 




0.000 


-1.484e-03 


0.000 




90 


-8.39e+00 


0.000 


-4.139e+02 




0.000 


-6.767e-02 


0.216 


60 


5 


l.Ole+01 


0.000 


9.559e+02 




0.000 


1.508e-03 


0.000 




10 


4.30O+00 


0.000 


6.8680+02 




0.000 


I.5O80-O3 


0.000 




30 


3.37O+00 


0.000 


2.221c + ()2 




0.000 


1.484O-03 


0.000 




90 


-5.O2C+00 


0.000 


-1.918e + 02 




0.000 


-6.619O-02 


0.242 


90 


5 


I.5I0+OI 


0.000 


1.148C+03 




0.000 


6.769O-02 


0.216 




10 


9.320+00 


0.000 


8. 7860+02 




0.000 


6.769O-02 


0.216 




30 


8.39C+00 


0.000 


4. 1390+02 




0.000 


6.767O-02 


0.216 




60 


5.02C+00 


0.000 


I.9I80+O2 




0.000 


6. 6190-02 


0.242 


H,anking 


M90 > M60 > 


MIO > M5 


M90 > M60 > 


M30 > M5 


M60 > M30 > 


MIO > M5 






M30 > M5 


MIO > 


M5 


M90 > 


M5 


















6 


10 


-7.207E-03 


0.174 


-2.776O+03 




0.885 


-1.354e-02 


0.998 




30 


-3.896E-03 


0.864 


-7.968O+03 




0.004 


-5.88I0-O2 


0.009 




60 


2.329E-03 


1.000 


-7.342e+03 




0.008 


-8.794e-02 


0.000 




90 


8.649E-03 


0.001 


-1.268e+04 




0.000 


-1.142e-01 


0.000 


10 


5 


7.207B-03 


0.174 


2.776e+03 




0.88B 


1.3B4e-02 


0.998 




30 


3.311E-03 


0.983 


-5.192e+03 




0.234 


-4.527e-02 


0.082 




60 


9.535E-03 


0.833 


-4.566C+03 




0.378 


-7.44O0-O2 


0.000 




90 


1.586E-02 


0.000 


-9.899C+03 




0.006 


-1.007O-01 


0.000 


30 


5 


3.896E-03 


0.864 


7.968O+03 




0.004 


5.88I0-O2 


0.009 




10 


-3.311E-03 


0.983 


5.192O+03 




0.234 


4.527O-02 


0.082 




60 


6.225E-03 


0.930 


6.2,")4o+02 




1.000 


-2. 9130-02 


0.354 




90 


1.254E-02 


0.000 


~4.707c+03 




0.678 


-5.54O0-O2 


0.000 


60 


5 


-2.329E-03 


1.000 


7.342O+03 




0.008 


8. 7940-02 


0.000 




10 


-9.535E-03 


0.533 


4.566O+03 




0.378 


7.440e-02 


0.000 




30 


-6.225E-03 


0.930 


-6.254O+02 




1.000 


2.913e-02 


0.354 




90 


6.320E-03 


0.884 


-5.333e+03 




0.491 


-2.627e-02 


0.247 


90 


6 


-8.649E-03 


0.001 


1.268e+04 




0.000 


1.142e-01 


0.000 




10 


-1.S86E-02 


0.000 


9.899e+03 




0.006 


1.007e-01 


0.000 




30 


-1.254E-02 


0.000 


4.707e+03 




0.678 


B.B40e-02 


0.000 




60 


-6.320E-03 


0.884 


B.333e+03 




0.491 


2.627e-02 


0.247 


Hanking 


M60 > M90 


M6 > M90 


MIO > M5 


M30 > MS 


MIO > M5 


M30 > M5 






MIO > M90 


M30 > M90 


M60 > M5 


M90 > M5 


M60 > M5 


M90 > M5 



Table 12: Results for all the functions of the multiple comparison test and the ranking 
obtained depending on the number of best individuals n. 
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I 


J 


m - /iJ 


a* 


m - /"J 


a* 




* 

Cr 






fsph 




fschDS 


IRos 




0.70 


0.90 


-1.361e-08 


0.000 


-3.985C-01 


0.000 


-1.360C-01 


0.281 




0.95 


-4.394e-08 


0.000 


-3.783e-02 


0.967 


-1.693C-01 


0.131 




0.99 


-1.302e-07 


0.000 


8.165C-02 


0.114 


-1.813c-01 


0.310 


0.90 


0.70 


1.361C-08 


0.000 


3.985e-01 


0.000 


1.360C-01 


0.281 




0.95 


-3.033C-08 


0.000 


3.607C-01 


0.001 


-3.333C-02 


0.995 




0.99 


-1.166C-07 


0.000 


4.802C-01 


0.000 


A f^Q'i^ no 


u.yyo 


0.95 


0.70 


4.394e-08 


0.000 


3.783e-02 


0.967 


1.693e-01 


0.131 




0.90 


3.033e-08 


0.000 


-3.607e-01 


0.001 


3.333e-02 


0.995 




0.99 


-8.628e-08 


0.019 


1.195e-01 


0.013 


1 onn« no 


i.UUU 


0.99 


0.70 


1.302e-07 


0.000 


-8.165e-02 


0.114 


1.813e-01 


0.310 




0.90 


1.166e-07 


0.000 


-4.802e-01 


0.000 


4.533e-02 


0.996 




0.95 


8.628e-08 


0.019 


-1.195e-01 


0.013 


1 onrin no 


1 nnn 
i.UUU 


Ranking 


MO. 99 > MO. 95 > MO. 90 > MO. 70 


Mo. 90 > MO. 9.5 


> Mo. 99 


MO. 99 > MO. 95 > 


MO. 90 > MO. 70 










MO. 70 > MO. 99 










fRas 




fsch 




^Ack 




0.70 


0.90 


-4.23C+00 


0.000 


1.198e+02 


0.714 


-2.471C-04 


0.000 




0.95 


-3.59C+00 


0.000 


8.247e+01 


0.919 


-1.944C-02 


0.617 




0.99 


-5.56e+00 


0.000 


-3.008e+02 


0.001 


-3.541e-02 


0.382 


0.90 


0.70 


4.23e+00 


0.000 


-1.198e+02 


0.714 


2.471e-04 


0.000 




0.95 


6.40e-01 


0.966 


-3.736e+01 


0.997 


-1.919e-02 


0.631 




0.99 


-1.33e+00 


0.551 


-4.206e+02 


0.000 


-3.516e-02 


0.390 


0.95 


0.70 


3.59e+00 


0.000 


-8.247e+01 


0.919 


1.944e-02 


0.617 




0.90 


-6.40e-01 


0.966 


3.736e+01 


0.997 


1.919e-02 


0.631 




0.99 


-1.97C+00 


0.044 


-3.8.3,3c+02 


0.000 


-1.597e-02 


0.985 


0.99 


0.70 


5.56C+00 


0.000 


3.008C+02 


0.001 


3.541e-02 


0.382 




0.90 


1.33e+00 


0.551 


4.206e+02 


0.000 


3.516C-02 


0.390 




0.95 


1.97C+00 


0.044 


3.833e+02 


0.000 


1.597C-02 


0.985 


Ranking 


Mo. 99 > MO. 95 


> MO. 70 


MO. 70 > MO. 95 


> MO. 90 


Mo. 99 > MO. 95 > MO. 70 






MO. 90 > MO. 70 


MO. 99 > MO. 90 


MO. 90 > 


MO. 70 






fCri 




fpie 




fLan 




0.70 


0.90 


-7.196E-03 


0.395 


-2.986e+03 


0.717 


6.105e-03 


0.998 




0.95 


-2.027E-03 


0.945 


-3.241e+03 


0.635 


2.867e-02 


0.272 




0.99 


-5.667E-03 


0.155 


-3.079e+03 


0.644 


3.309e-02 


0.133 


0.90 


0.70 


7.196E-03 


0.395 


2.986e+03 


0.717 


-6.105e-03 


0.998 




0.95 


5.168E-03 


0.791 


-2.547e+02 


1.000 


2.257e-02 


0.585 




0.99 


1.529E-03 


1.000 


-9.255e+01 


1.000 


2.698e-02 


0.363 


0.95 


0.70 


2.027E-03 


0.945 


3.241e+03 


0.635 


-2.867C-02 


0.272 




0.90 


-5.168E-03 


0.791 


2.547e+02 


1.000 


-2.257C-02 


0.585 




0.99 


-3.640E-03 


0.747 


1.622e+02 


1.000 


4.415C-03 


1.000 


0.99 


0.70 


5.667E-03 


0.155 


3.079C+03 


0.644 


-3.309C-02 


0.133 




0.90 


-1.529E-03 


1.000 


9.255C+01 


1.000 


-2.698C-02 


0.,363 




0.95 


3.640E-03 


0.747 


-1.622e+02 


1.000 


-4.415e-03 


1.000 


Ranking 


MO. 90 > MO. 99 > MO. 95 > MO. 70 


MO. 95 > MO. 99 > MO. 90 > MO. 70 


MO. 70 > MO. 90 > 


MO. 95 > MO. 99 



Table 13: Results for all the functions of the multiple comparison test and the ranking 
obtained depending on the confidence coefficient 1 — a. 
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Crossover 



fSph 



I 


J 


Ml - Mj 




Ml - Mj 


o* 


Ml - Mj 


o * 


CIXL2 


BLX(0.3) 


3.1[)9c-16 


[).[)[)() 


-1.583C-02 


0.000 


-4.283C+00 


0.997 




BLX(0.5) 


1.628C-16 


0.212 


-7.337c-03 


0.028 


-6.667c+00 


0.933 




SBX(2) 


-1.644c-12 


0.000 


-2.014C-01 


0.000 


-2.809C+00 


0.958 




SBX(5) 


-4.873C-12 


0.000 


-3.913C-01 


0.000 


-6.165C+00 


0.944 






-2.102e-15 


0.000 


-3.968e+01 


0.000 


-2.487e+00 


1.000 




Logical 


-3.689e-13 


0.000 


-1.098e+01 


0.000 


-2.092e+00 


0.000 




UNDX 


-2.910e-05 


0.000 


-2.080e+01 


0.000 


-3.460e+00 


0.000 


BLX(0.3) 


CIXL2 


-3.109C-16 


0.000 


1.583e-02 


0.000 


4.283e+00 


0.997 




BLX(0.5) 


-1.48UC-16 


0.074 


8.495C-03 


0.357 


-2.384C+00 


1.000 




SBX{2) 


-1.644e-12 


0.000 


-1.855e-01 


0.000 


1.473e+00 


1.000 




SBX(5) 


-4.873e-12 


0.000 


-3.755e-01 


0.000 


-1.882e+00 


1.000 




Fuzzy 


-2.413e-15 


0.000 


-3.966e+01 


0.000 


l,796e+00 


1.000 




Logical 


-3.692C-13 


0.000 


-1.097C+01 


0.000 


2.191C+00 


1.000 




UNDX 


-2.91()c-()5 


0.000 


-2.()78e + ()l 


().()()() 


8.22,5o-01 


1.000 


BLX(0.5) 


CIXL2 


-1.628e-16 


0.212 


7.337e-03 


0.028 


6.667e+00 


0.933 




BLX(0.3) 


1.480e-16 


0.074 


-8.495e-03 


0,357 


2.384e+00 


1.000 




SBX(2) 


-1.644e-12 


0.000 


-1.940e-01 


0.000 


3.857e+00 


1.000 




SBX(5) 


-4.S73e-12 


0.000 


-3.840e-01 


0.000 


5.019e-01 


1.000 






-2.265C-15 


0.000 


^ Qfi7n-l-ni 
o . i C^ IJ ± 


().()()() 


4 1 7Qn-i-nn 


1 .000 




Logical 


-3.690e-13 


0.000 


-1.098e+01 


0.000 


4.575e+00 


1.000 




UNDX 


-2.910e-06 


0.000 


-2.079e+01 


0.000 


3.206e+00 


1.000 


SBX(2) 


CIXL2 


1.644e-12 


0.000 


2.014e-01 


0.000 


2.809e+00 


0.958 






1.644e-12 


0.000 


J- . OOOt5"\J J- 


0.000 




1.000 




BLX(0.5) 


1.644e-12 


0.000 


1.940e-01 


0.000 


-3.857e+00 


1.000 




SBX(5) 


-3.229e-12 


0.000 


-1.900e-01 


0.115 


-3.355e+00 


1.000 




Fuzzy 


1.642e-12 


0.000 


-3.948c+01 


0.000 


3.222C-01 


1.000 




Logical 


1.275e-12 


0.000 


-1.078C+01 


0.000 


7.179C-01 


1.000 




UNDX 


-2.910e-05 


0.000 


-2.060e+01 


0.000 


-6.508e-01 


1.000 


SBX(5) 


CIXL2 


4.873e-12 


0.000 


3.913e-01 


0.000 


6.165e+00 


0.944 




BLX{0.3) 


4.873e-12 


0.000 


3.755e-01 


0.000 


1.882e+00 


1.000 




BLX(0.5) 


4.873e-12 


0.000 


3.840C-01 


0.000 


-5.019C-01 


1.000 




SBX(2) 


3.229c-12 


().()()() 


1.900C-01 


0.115 


3.355c+00 


1.000 




Fuzzy 


4.871e-12 


0.000 


-3.929e+01 


0.000 


3.678e+00 


1.000 




Logical 


4.504e-12 


0.000 


-1.059e+01 


0.000 


4.073e+00 


1.000 




UNDX 


-2.910e-06 


0.000 


-2.041e+01 


0.000 


2.705e+00 


1.000 


Fuzzy 


CIXL2 


2.102O-15 


0.000 


3.968e+01 


0.000 


2.487C+00 


1.000 




BLX(0.3) 


2.4130-15 


0.000 


3.966e+01 


0.000 


-1.796O+00 


1.000 




BLX(0.5) 


2.265e-15 


0.000 


3.967e+01 


0.000 


-4.179e+00 


1.000 




SBX(2) 


-1.642e-12 


0.000 


3.948e+01 


0.000 


-3.222e-01 


1.000 




SBX(5) 


-4.871e-12 


0.000 


3.929e+01 


0.000 


-3.678e+00 


1.000 




Logical 


-3.668e-13 


0.000 


2.870e+01 


0.000 


3.957e-01 


1.000 




UNDX 


-2.910c-()5 


0.000 


1.888C+01 


().()()() 


-9.730e-01 


1.000 


Logical 


C1XL2 


3.689C-13 


0.000 


1.098e+01 


0.000 


2.092C+00 


0.000 




BLX(0.3) 


3.6920-13 


0.000 


1.097C+01 


0.000 


-2.191C+00 


1.000 




BLX(0.5) 


3.690O-13 


0.000 


1.098e+01 


0.000 


-4.575e+00 


1.000 




SBX(2) 


-1.275e-12 


0.000 


1.078e+01 


0.000 


-7.179e-01 


1.000 




SBX(B) 


-4.604e-12 


0.000 


1.069e+01 


0.000 


-4.073e+00 


1.000 




Fuzzy 


3.668e-13 


0.000 


-2.870e+01 


0.000 


-3.957e-01 


1.000 




UNDX 


-2.910C-05 


0.000 


-9.812C+00 


0.000 


-1.369C+00 


0.000 


UNDX 


C1XL2 


2.910C-05 


0.000 


2.080C+01 


0.000 


3.460C+00 


0.000 




BLX(0.3) 


2.910e-05 


0.000 


2.078e+01 


0.000 


-8.225e-01 


1.000 




BLX(0.5) 


2.910e-0B 


0.000 


2.079e+01 


0.000 


-3.206e+00 


1.000 




SBX{2) 


2.910e-05 


0.000 


2.060e+01 


0.000 


6.508e-01 


1.000 




SBX{5) 


2.910C-05 


0.000 


2.041C+01 


0.000 


-2.70,5o+00 


1.000 




Fuzzy 


2.910c-()5 


().()()() 


-1.888C + 01 


0.000 


9.730C-01 


1.000 




Logical 


2.910C-05 


0.000 


9.812e+00 


0.000 


1.369C+00 


0.000 



Function 


Ranking 




iSph 

fSchDS 

fRos 


MC/NDX > MSBX(5) > MSBX(2) > fJ-Logical > MEict.F. > I^CIXL2 > Mbi,X(0.5) > 
I^Ext.F. > Mt/ATDX > Miogioai > MSBX(5) S MSBX(2) > Mfli:,X(0.3) 2: MBj:,X(0.5) 
MBJ:,X(0.51 ^ MSBX(51 ^ MBI,X(0.31 ^ t^UNDX > f^SBX(2) > MEa;t.F. > f^Logical 


MbZ.X{0.3) 

> I^CIXL2 

> fCIXL2 



Table 14: Results of the multiple comparison tests for fsph, fschDS Y Iros functions and 
the ranking established by the test regarding the crossover operator. 
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Crossover 






fSch 




fAck 


I 


J 


Ml — Mj 


a * 


Ml — Mj 


a* 


Ml Mj_ 




CIXL2 


BLX(0.3) 


'V I'll 


0.923 


2 . 71 r)C-|-()2 


— TTniWi 


-2.830c-08 


o.ooo 




BLX(0.5) 




1.000 


z . z ii)c-\-i)z 


11 Ilnll 


-5.090c-08 


().(J()0 




SBX(2) 


- 1 .oozc-\-yi i 




OW^^i-LAO 

-o.^o / c-\-yiA 


■ 


-5.322c-06 


u.ooo 




SBX(5) 


- 1 . 1 Zoc-\-\j i- 


A AAA 

0.000 


-4.DoiC-|-Uz 


0.000 


-9.649c-06 


0.000 




Fuzzy 


-i.yooe-|-Ui 


0.000 


O Af\Q^ 1 AQ 


0.000 


-1.659e-07 


0.000 




Logical 


-D.Uooe-|-Ui 


0.000 


-i.yooe-|-uo 


0.000 


-2.517e-06 


0.000 




UNDX 


-1 n7Rp-i-n9 


0.000 


-7 d.nQp-1-ni 


0.000 


-3.550e-02 


0.000 


BLX(0.3) 


CIXL2 


- 1 . zi-JDC-Ui 


n 7?-^ 


O '71 1 AO 


0.000 


2.830c-08 


0.000 


BLX(0.5) 


-8.291 c- 1 




-o .UoUC-|-U i 




-2.261c-08 


0.000 




SBX{2) 




n nnn 


1 1 AAc-LAQ 

- 1. iuue-|-uo 




-5.293e-06 


0.000 




SBX(5) 


- J- . J- e-|- u -L 


n nnn 


7 OAit^ 1 AO 


n mn 


-9.620e-06 


0.000 




Fuzzy 


- J.U JDe-|-Ui 


A AAA 

0.000 


O flQAr. 1 AQ 


A AAA 

0.000 


-1.376e-07 


0.000 




Logical 


-D. ±UDC-|-Ui 




of;n^i-j_nQ 




O /I O O ^ Ai? 


0.000 




UNDX 


~1 .()85c+()2 


0.t)t)0 


- / . ij o ij ^ ij o 


0.000 


-3.550c-02 


0.0(30 


BLX(0.5) 


CIXL2 


n OKA^ AO 


1.000 


O O 1 A^ 1 AO 


0.010 


5.090e-08 


0.000 




BLX(0.3) 


Q om — A1 

o.^yie-ui 


0.713 


K AKA», 1 AI 

o.uoue-|-ui 


1.000 


2.261e-08 


0.000 




SBX(2) 


1 g /I 1 A1 

- i.o4ze-|-ui 




1 A K A„ 1 AO 




-5.271e-06 


0.000 




SBX(5) 


1 11 1 A1 

- i . i ioe-|-Ui 


A AAA 

0.000 


Q A 1 ^ 1 AO 


A AAA 

0.000 


-9.598e-06 


0.000 




Fuzzy 


1 A /I ^1 _L A 1 




(^OO^i-LA'^ 




-1.150c-07 


0.000 




Logical 


~D.UzaC-|-UI 


A AAA 

0.000 


O OrflA.-^ 1 rflQ 

-Z.ZUi)C-\-U6 


000 


-2.466c- 06 


0.000 




UNDX 


-1 .()77c+()2 


0.000 


- * . U O U IK) 


0.000 


-3.55t)c-t)2 


0.000 


tlizi-v /'ri\ 

SBX(2) 


CIXL2 


i.oozC-|-Ui 


0.000 


O.ZO i C-f-UZ 


0.000 


5.322C-0G 


0.000 




BLX(0.3) 


1 fjocTf- 1 m 
± . Dzoe-|-ui 


^■^^^ 


1 1 nnQ-LA'? 


^■^^^ 


5.293e-06 


0.000 




BLX(0.5) 


i.o4ze-i-Ul 


0.000 


1 AEA^ 1 AO 

i.UoUe+Uo 


0.000 


5.271e-06 


0.000 




SBX(5) 


4. z4oe-|-UU 


0.005 


O fiKK^ 1 AO 


0.006 


-4.327e-06 


0.000 




Fuzzy 


A m Qi-._LAA 
-4.Ul0C-|-UU 




1 t:'7A„ 1 AO 




5.156e-06 


0.000 




Logical 


/I /I Q 1 ^ 1 A1 


0.000 


1 1 A 1 1"! O 


0.000 


O OAC^ Ai? 


0.000 




UNDX 


Q 997o-l_ni 


0.000 




0.000 


-3.550e-02 


0.000 


SBX(5) 


CIXL2 


1 1 OO^ 1 A1 


0.000 


A ei'31 ^ 1 AO 


0.000 


9.649e-06 


0.000 




BLX(0.3) 


1 oni ^ I ni 


0.000 


T 0/1C^ 1 AO 

( .o4De-|-Uz 


0.000 


9.620e-06 


0.000 




BLX(0.5) 


1 11 1 A1 

i . i ioc-|-Ui 




C Q /I 1 ^ 1 AO 


0.000 


9.598c-06 


0.000 




SBX(2) 


A 0/1I^^i_LAA 




q «c:e„_lao 
-O.0OOC-|-Uii 




4.327C-06 


0.000 




Fuzzy 


O O C O ^ 1 A A 

-o.2ooe-|-UU 


0.000 


1 A ^ C ^ 1 AO 

-i.y4oe+Uo 


0.000 


9.483e-06 


0.000 




Logical 


A flAK^ 1 AI 

-4.yuoe-|-ui 


0.000 


1 KOK^ 1 AO 


0.000 


7.iJ2e-UD 


0.000 






-Q fi'll p-uni 


0.000 




0.000 


-0. 00U6-U^ 




Fuzzy 


CIXL2 


1 Qc;'^*j_i_ni 
± . y^jot;— r u X 


0.000 


2.408c+03 


0.000 


1.659c-07 


0.000 




BLX(0.3) 


2.026C+01 


0.000 


2.fS8{)c + ()3 


0.000 


1.37(ic-()7 


0.000 




BLX(0.5) 


1.943e+01 


0.000 


2.629e+03 


0.000 


1.150e-07 


0.000 




SBX(2) 


4.013e+00 


0.042 


1.579e+03 


0.000 


-B.lB6e-06 


0.000 




SBX(5) 


8.2586+00 


0.000 


1.945e+03 


0.000 


-9.483e-06 


0.000 




Logical 


-4.079e+01 


0.000 


4.199e+02 


0.000 


-2.351e-06 


0.000 




UNDX 


-S.S26C+01 


0.000 


-5.001c+a3 


0.000 


-3.55()c-02 


0.000 


Logical 


CIXL2 


6.033C+01 


0.000 


1.98Sc+a3 


0.000 


2.517C-06 


0.000 




BLX(0.3) 


6.106C+01 


0.000 


2.260C+03 


0.000 


2.4SSC-06 


0.000 




BLX{0.5) 


6.023e+01 


0.000 


2.209e+03 


0.000 


2.466C-06 


0.000 




SBX(2) 


4.481e+01 


0.000 


1.159e+03 


0.000 


-2.805e-06 


0.000 




SBX(5) 


4.905e+01 


0.000 


1.525e+03 


0.000 


-7.132e-06 


0.000 




Fuzzy 


4.079e+01 


0.000 


-4.199e+02 


0.000 


2.351e-06 


0.000 




UNDX 


-4.746e+01 


0.000 


-5.421e+03 


0.000 


-3. 5500-02 


0.000 


UNDX 


CIXL2 


1.078C+02 


0.000 


7.409C+03 


0.000 


3.550C-02 


0.000 




BLX(0.3) 


1.085e+02 


0.000 


7.680e+03 


0.000 


3.550e-02 


0.000 




BLX(0.5) 


1.077e+02 


0.000 


7.630e+03 


0.000 


3.550e-02 


0.000 




SBX(2) 


9.227e+01 


0.000 


6.580e+03 


0.000 


3.660e-02 


0.000 




SBX{5) 


9.651C+01 


0.000 


6.946C+03 


0.000 


3.550C-02 


0.000 




Fuzzy 


8.826c+()l 


().()()() 


r).()()lc+03 


0.000 


3.550C-02 


o.ooo 




Logical 


4.746e+01 


0.000 


5.421C+03 


0.000 


3.550e-02 


0.000 


Function 








Ranking 








fRas 


fJ-UNDX 


> y-Logical > fJ-E. 


xt.F. > (^SBX{2) > f^SBX{5) 


> f^BLX(Q.6) 


> P'CIXL2 > 




fSch 




> y-Ext.F. > fJ-Lo 


gical ^ 


f^SBX(2) > f^SBX(5) > fJ-CIXL2 > MBI.X(0.5) > 




fAck 


fJ'UNDX 


> f^SBX(5) > f^SBX(2) > t^Logical > fJ'E^nt.F. 


> I^BLX(0.5) > f^BLX(0.3) 


> fCIXL2 



Table 15: Results of the multiple comparison tests for fuas, fsch and fAck functions and 
the ranking established by the test regarding the crossover operator. 
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Crossover 






fpl 




fLan 




I 


J 


f^I — Mj 


a* 


Ml — Mj 


a* 


Ml — Mj 




CIXL2 


BLX(0.3) 


-3 . 224c-{)2 


0.021 


-4. / l\iC-\-\lJ, 


— TTuw\ 

1.000 


;J.oo4c-Uz 


0.091 




BLX(0.5) 


-2 . 23r)C-()2 


0.012 


-J,. 1 ci\)C-\-\]'A 




1 0<i I'll 


0.007 




SBX(2) 


-D. / iUc-Uo 


0.973 


1 T At\f-. 1 t\ A 
- i . / 4UC-|-U4 


0.034 


1 o o no 


1.000 




SBX(5) 


- i .OUoC-UZ 


0. 167 


- i .o iUc-f-U4 


?'nnn 


1 nQo», no 


1.000 




Fuzzy 




0.000 


1 CQC^ 1 nQ 
-i.DoDe-|-Uo 


1.000 


1 nnn^ m 
-i.UUUe-Ui 


0.000 




Logical 


y.i / oe-uo 


0.057 


-J..iyDe-|-U4 


0.709 


O C\£iAr^ ni 


0.000 




UNDX 


-6.312e-02 


0.000 




0.009 


T_> . UU * c-OO 


1.000 


BLX(0.3) 


CIXL2 


1 oo/i,^ no 
o.zz4c-Uz 




A '7'7Cir-. 1 no 
4. i i\jc-\-\)J, 




-:^.oo4c-Uz 


? nnn 


BLX(0.5) 




1.000 


O Q11^ 1 t\'i 

- J.dl ic-|-Uo 


1 .000 


4. 54{)c-{)2 






SBX{2) 


o '^'^'^£1 no 




- i.Dyoe-|-U4 


n no^ 


- 1 . UD4e-U-L 


046 




SBX(5) 


1 £!oi^ no 


0.952 


1 1 n/1 
-1. ( Doe-|-U4 


0.029 


-1. 137e-01 


0.013 




Fuzzy 


A o ^ no 


0.000 


1 ono^ 1 no 


1.000 


1 no o ^ m 
-l.yooe-Ui 


0.000 




Logical 


A 1 A o^ no 


0.001 


1 1 /I 1 n /I 




1 nno», ni 






UNDX 


-3.U88c-(J2 


0.2.32 




0.012 


-8.728c-t)2 


t) . 1 5 1 


BLX(0.5) 


CIXL2 


o OQc:-, no 


0.012 


o 'vacir-. 1 no 


1.000 


1 'I n o ^ n 1 


0.007 




BLX(0.3) 


n QnQ^ HQ 

-y.csyoe-ud 


1.000 


o Qi 1 1 no 


1.000 


A K.Af\r^ no 
-4. o4Ue-U^ 


1.000 




SBX(2) 


1 K(KAi^ no 




1 /I fll " 1 n/i 


n i^o? 


-1 . 518e-01 






SBX(5) 


a Qon^ nQ 


1.000 


1 cr o 1 ^ 1 n/1 




1 Km ni 
- 1 . oy le-Ul 


0.001 




Fuzzy 






1 . lU4c-|-Uo 


1 nnn 


o '^qoq m 






Logical 


3. 152c-02 


000 


-y . io9c-|-Uo 


1 000 


O AKTr-. ni 


0.000 




UNDX 


-4.()77c-()2 


t).003 


-1 .668c+()4 


0.054 


-1.327c- 01 


0.012 


tlizi-v /'ri\ 

SBX(2) 


CIXL2 


O. ( iUc-Uo 


0.973 


i . / 4Uc-|-U4 


0.034 




1.000 




BLX(0.3) 


O c;cq„ no 




i.oyoe-|-U4 




1 nft/ic ni 
i.U04e-ui 






BLX(0.5) 


1 A ^ no 


0.361 


i.4Die-|-U4 


0.179 


1.518e-01 


0.004 




SBX(5) 


n Qon^ nQ 


0.980 


*7 nno^ 1 no 


1.000 


•7 oo no 


1.000 




Fuzzy 


O nfir:,, no 


0.000 


1 cyo^ 1 n/1 


0.095 


Q '7'i'7^ no 


0.008 




Logical 


1.588c- 02 


0.003 


f, A ACir-. \ n'i 


1 .000 


1 non», ni 
-l.yoye-Ul 


0.000 




UNDX 




0.000 




1.000 


1 QnQp-n9 


1.000 


SBX(5) 


CIXL2 


1 /3nQ^ no 


0.167 


1 o 1 n« 1 n A 

i.esiUe-|-U4 


0.022 


1 noOrt no 


1.000 




BLX(0.3) 


1 fioi ^ no 


0.952 


L. / Doe-|-U4 


0.029 


1. 137e-01 


0.013 




BLX(0.5) 


oon^ no 


1.000 


1 c o 1 ^ 1 n /i 


0.121 


1 .591c-01 


0.001 




SBX(2) 






'7 nno^ 1 no 




7. 285c-(J3 






Fuzzy 


o nnT^ no 

j.yy f e-u^ 


0.000 


1 ^ o ^ 1 n /i 
i.D4Je-|-U4 


0.063 


o n 1 o ^ no 


0.004 




Logical 


o Kon^ no 


0.001 


(3 1/1 eir, 1 no 
D. i4oe-|-Ui3 


1.000 


1 oatir^ ni 


0.000 




UNDX 




0.000 




1.000 




1.000 


Fuzzy 


CIXL2 


-1.394C-02 


0.000 


1.686c+03 


1.000 


l.OOOc-01 


0.000 




BLX(0.3) 


-4.61SC-02 


0.000 


1.2080+03 


1.000 


1.938C-01 


0.000 




BLX(0.5) 


-3.629e-02 


0.000 


-1.104e+03 


1.000 


2.392e-01 


0.000 




SBX(2) 


-2.066e-02 


0.000 


-1.572e+04 


0.095 


8.747e-02 


0.008 




SBX(5) 


-2.997e-02 


0.000 


-1.642e+04 


0.063 


8.018e-02 


0.004 




Logical 


-4.763e-03 


0.025 


-1.027e+04 


1.000 


-1.064e-01 


0.000 




UNDX 


-7.706C-02 


0.000 


-1.778C+04 


0.027 


1.066C-01 


0.000 


Logical 


CIXL2 


-9.173C-03 


0.057 


1.196C+04 


0.709 


2.064C-01 


0.000 




BLX(0.3) 


-4.142e-02 


0.001 


1.148C+04 


0.888 


3.003e-01 


0.000 




BLX(0.5) 


-3.152e-02 


0.000 


9.169C+03 


1.000 


3.457e-01 


0.000 




SBX(2) 


-1.588e-02 


0.003 


-5.446e+03 


1.000 


1.939e-01 


0.000 




SBX(5) 


-2.520e-02 


0.001 


-6.146e+03 


1.000 


1.866e-01 


0.000 




Fuzzy 


4.763e-03 


0.025 


1.027e+04 


1.000 


1.064e-01 


0.000 




UNDX 


-7.229e-02 


0.000 


-7.507e+03 


1.000 


2.130e-01 


0.000 


UNDX 


CIXL2 


6.312e-02 


0.000 


1.947C+04 


0.009 


-6.557e-03 


1.000 




BLX(0.3) 


3.088e-02 


0.252 


1.899e+04 


0.012 


8.728e-02 


0.151 




BLXC0.5) 


4.077e-02 


0.003 


1.668e+04 


0.054 


1.327e-01 


0.012 




SBX(2) 


5.641e-02 


0.000 


2.061e+03 


1.000 


-1.909e-02 


1.000 




SBX{5) 


4.709O-02 


0.000 


1.361C+03 


1.000 


-2.637O-02 


1.000 




Fuzzy 


7.7()6o-02 


0.000 


1.778C+04 


0.027 


-1.0(i(ic-01 


0.000 




Logical 


7.229O-02 


0.000 


7.507C+03 


1.000 


-2.130e-01 


0.000 


Function 








Ranking 









iGri IIUNDX > MBtx(0.3) > I^BLX(0.5) > MSBX(5) > MSBX{2) > IJ-CIXL2 > /^Logical > MExt.F. 

fpie IIUNDX > MSBX(5) S MSBX(2) > I^Logioal > MBiX(0.5) ^ MEit.F. > I^BLX^O.3) ^ liCIXL2 

than IJ-Logical > fBxt.F. > fSBX(5) ^ MSBX(2) > l^-C I X L2 > l^-UNDX > MBI,X(0.3) ^ MBI,X(0.5) 



Table 16: Results of the multiple comparison tests for fcri, fpie and fLan functions and 
the ranking established by the test regarding the crossover operator. 
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I 


J 


MI - MJ 


a* 


MI - MJ 


a* 


MI - MJ "* 


Ml - MJ 


a* 






fSchDS 


fRos 




fRas 


fSch 




CIXL2 


UMDAc 


-2.221e+01 


0.000 


-2.928e+00 0.000 


-1.547e+02 0.000 


-1.089e+04 


0.000 




EGNABGe 


-2.076e-01 


0.000 


-2.906e+00 0.000 


-1.533e+02 0.000 


-1.091e+04 


0.000 


UMDAc 


CIXL2 


2.221C+01 


0.000 


2.928C+00 


0.000 


1.547C+02 0.000 


1.089e+04 


0.000 




EGNAbgc 


2.200C+01 


0.000 


2.207C-02 


0.856 


1..360C+00 0.888 


-2.390e+01 


0.677 


EGNAbgc CIXL2 


2.076e-01 


0.000 


2.906C+00 


0.000 


1.533C+02 0.000 


1.091C+04 


0.000 




UMDAc 


-2.200e+01 


0.000 


-2.207e-02 


0.856 


-1.360e+00 0.888 


2.390e+01 


0.677 


Function 










Ranking 






fSchDS 










> liEGN Ag(^c > P'CIXL2 






Iros 








f UMDAc 


> liBGN AgQc > P'CIXL2 






fitas 








I'UMDAc 


> I^EGNAgQc > IJ'CIXL2 






fSch 








t^EGN Afinr — I^UMDAc > I^CIXL2 










fAck 




fori 




fpie 


fLan 




CIXL2 


UMDAc 


-l.lOlc-08 


0.000 


1.525C-02 


0.000 


9.803C+03 0.004 


-3.306C-02 


0.176 




EGNABGe 


-9.194C-09 


0.000 


1.525C-02 


0.000 


6.157C+03 0.150 


-3.306C-02 


0.176 


UMDAc 


CIXL2 


l.lOlc-08 


0.000 


-1.525C-02 


0.000 


-9.803C+03 0.004 


3.306C-02 


0.176 




EGNABGe 


1.817C-09 


0.175 


1.266C-16 


0.000 


-3.646C+03 0.049 


1.33781C-11 


0.325 


EGNABGe 


CIXL2 


9.194e-09 


0.000 


-1.525e-02 


0.000 


-6.157e+03 0.150 


3.306e-02 


0.176 




UMDAc 


-1.817e-09 


0.175 


-1.266e-16 


0.000 


3.646e+03 0.049 


-1.33781e-ll 


0.325 


Function 










Hanking 






fAck 








liUMDAc 


> IJ'BGN AgQi, > l'CIXL2 






fOri 








IJ'CIXL2 > f^UMDAa > MEGNAeGe 






fple 








liCIXL2 > MEGJVAgCj, > Mt/MDAo 






fLan 










> IJ^BGN Afscir — I^CIXL2 







Table 17: Results for all the functions of the multiple comparison test and the ranking 
obtained depending on the evolutionary algorithm. 
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Appendix B. Convergence Graphics 
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Figure 5: Evolution of the average fitness, in logarithmic scale, using different crossover 
operators for the function fsph- 
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Figure 6: Evolution of the average fitness, in logarithmic scale, using different crossover 
operators for the function fschDS- 
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Figure 7: Evolution of the averaged fitness, in logarithmic scale, using different crossover 
operators for the function /ros- 
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Figure 8: Evolution of the average fitness, in logarithmic scale, using different crossover 
operators for the function fnas- 
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Figure 9: Evolution of the average fitness, in logarithmic scale, using different crossover 
operators for the function fsch- 
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Figure 10: Evolution of the average fitness, in logarithmic scale, using different crossover 
operators for the function fAck- 
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Figure 11: Evolution of the average fitness, in logarithmic scale, using different crossover 
operators for the function fori- 
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Figure 12: Evolution of the average, in logarithmic scale, using different crossover operators 
for the function fpie- 



41 



ORTIZ-BOYER, HERVAS-MARTI'nEZ, & GARCfA-PEDRAJAS 



0.01 



D 0.0001 



le-10 





















CIXL2(0.70,5) 






BLX(0.3) . 






SBX(5J 












Logical 






UNDX 



50000 100000 150000 200000 250000 300000 



Figure 13: Evolution of the average fitness, in logarithmic scale, using different crossover 
operators for the function /loh- 
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